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BRQADB2^ MID-IIETKORK SERVER 

The present invention relates to internetworked 
comiaunication systems, and especially (but not 
exclusively) to a highly scalable, broadband mid-network, 
server for performing mid-network processing functions 

5 including routing functions, per user processing, 

encryption, bandwidth distribution and traffic shaping. 
Background and Summary of the Invention 

As bandwidths within the core of the Internet increase, 
there is an increasing trend towards using the Internet 

10 Protocol {"IP") as the core network layer protocol for all 
kinds of traffic, including voice, video and data. 

Historically, quality of service on the Internet has been 
what is called "best effort," That is, the network attempts 
to transport as much traffic as possible, but if there is 

15 insufficient capacity to handle the traffic, all connections 
are equally likely to be influenced by congestion. Thus, 
"best effort" implies that the Internet provides only one 
class of service to any connection, and that all connections 
are handled equally with no priority. 
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In the case of traditional Internet applications, this 
approach was often sufficient. However, the intrinsic 
potential of the Internet is considerably greater, and 
includes new multimedia and interactive applications. Voice 
5 over IP ("VoIP") and Real Time Video are envisioned to be two 
significant applications for propelling Internet growth to the 
next level. VoIP can be defined as the ability to make 
telephone calls and send faxes over IP networks. The benefits 
of this technology are cost reduction, simplification, 

10 consolidation and advanced applications such as shared screens 
or whiteboarding which corabine voice and data. Real Time 
Video is a 'Mirect-to-user" technique in which a video signal 
is transmitted to the client device and presentation of the 
video begins after a short delay for data buffering, and 

15 eliminates the need for significant client-site storage 
capacity. It is also expected to become popular with 
businesses. Related to this is webconferencing, which 
requires high bandwidth since it is a continuous transfer of 
image information together with voice transfer. 

20 Webconferencing also requires real time traffic handling 
because it is usually implemented as an interactive 
application. 

All of these new applications will generally require 
significant bandwidth and/or reduced latencies. Bandwidth is 

25 the critical factor when large amotmts of information must be 
transferred within a reasonable time period. Latency is the 
minimum time elapsed between requesting and receiving data and 
is important in real-time and interactive applications such as 
webconferencing and telecommuting. Presently, most 

30 telecommuters depend upon analog modems with limited bandwidth 
and significant latency for dial-up connectivity. Even for 
today's applications, dialup connectivity is often inadequate. 

There are competing "last mile" technologies today which 
provide transport services to the user for delivering packets 

35 to the "edge" of the Internet. To complete the communication, 
the packets need to be formatted to allow them to enter the 
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Internet cloud and find their way to their respective 
destinations. The emergence of supporting protocols for new 
applications and the growth spurt in number of users and the 
required bandwidth per user results in a very dynamic access 
5 environment . 

The following is a summary of observations that pertain 
to an ideal mid-network point within the Internet; 

• In order to accommodate a variety of source packets, 
all the requisite protocols must be efficiently 

10 supported. 

• virtual Private Network services allow a private 
network to be configured within a public network. 
This is one of the drivers for Internet access amongst 
businesses. To allow Virtual Private Networks to 

15 coexist on the public Internet, and to encourage 

business use of the Internet, great care must be taken 
with respect to security and authentication issues, 
and tunneling protocols such as L2TP and IPSec must be 
efficiently supported, 

20 • The number of subscribers handled by one system and 

the different qualities of service provided will make 
service provider administration more complex. To make 
provisioning of broadband access more attractive to 
service providers, subscriber management and usage 

25 accounting must be simplified, and differentiated 

seirvices must be provided. 

• Broadband makes it possible to provide different 
amounts of bandwidth to users and to smaller Internet 
Service Providers. To make wholesaling of IP 

30 connectivity possible, and to simplify service and 

repair functions, the ability to support multiple 
service providers with one itiid-network server must be 
provided. 

• A large niimber of connections are serviced with a 
35 broadband mid-network server. In order to ensure that 

service is not interrupted, the broadband server must 



wo 01/67694 



PCTAJSOl/01003 



have very high availability. Such availability is 
also required for mission-critical business 
applications . 

• Central office co-location space is limited. To 

5 conserve this space, large connection densities must 

be provided. 

• When subscribers are allowed access at high speeds, it 
is possible for a limited number of users demanding 
disproportionate amounts of bandwidth to disrupt 

10 service for other customers. To ensure that large 

traffic bursts do not overload small client buffers, 
and to ensure that service providers and customers are 
treated fairly, traffic shaping must be provided. 

• To enable new value-added services, large bandwidths 
15 and low latencies are critical. 

In order to solve these and other needs in the art, the 
inventors hereof have succeeded at designing and developing a 
broadband mid-network server that, in the most preferred 
embodiment, satisfies all of the requirements described above. 

20 This inventive server provides reliable, secure, fast, 

flexible, high-bandwidth, and easily managed access to the 
Internet so as to accommodate all current Internet services 
including email, file transfer, web surfing and e-commerce, as 
well as the new value added services such as VoIP and Real 

25 Time Video. To meet these requirements, the broadband mid- 
network server of the present invention has been designed to 
scale not only in bandwidth, but also in processing power and 
state space. In the preferred embodiment, the architecture 
allows a service provider to configure the cards chosen for 

30 use in the available chassis space to suit his particular 
application. For example, to maximize processing power, a 
service provider could increase the number of IPE cards at the 
expense of a fewer number of line cards; as few as one line 
card. In the case of one line card, the maximum amount of 

35 processing power would be available to a service provider. IN 
the preferred embodiment described in detail below, this 
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configuration would provide 240 processors and 39 gigabytes of 
memory- This would allow for a greater number and complexity 
of value added services which require more processing power. 
Alternatively, a greater number of line cards could be 
5 selected for use in a chassis which would be desirable to 

handle greater traffic and throughput at the expense of fewer 
value added services. 

The high bandwidth core routers that are currently under 
development by third parties are optimized for performing 

10 large niombers of fast routing lookups, but are not expected to 
provide generalized and flexible computing power for 
supporting the s"ubstantial amount of processing needed for, 
among other things, per user and per packet processing. In 
contrast, the broadband mid-network server of the present 

15 invention includes the ability to distribute traffic across a 
number of Internet processing engines and, more specifically, 
across a number of protocol processing units provided in each 
engine (the bandwidth to which can be coordinated), to provide 
compute power and state space required for performing per user 

20 processing for a large number of users. 

One important feature of the present invention is a 
unique architectural philosophy, which provides that 
processing, be performed as close to the physical layer as 
warranted by considerations of flexibility, cost and 

25 complexity. This architectural philosophy maintains 
balance between two kinds of processing which are 
important to scaling bandwidth with value-added services 
in broadband networks: time-consuming, repetitive 
processing; and flexible processing which must be easy to 

30 program by third parties. The need for considerable 

time-consiaming repetitive processing, which has proved to 
create a bottleneck in the processor-based servers of the 
prior art, is addressed by the inventive architecture 
through specialized hardware, and results in dramatic 

35 increases in speed and decreases in delay. The need for 



wo 01/67694 



PCTAJSOl/01003 



flexible, easy to use, computing power to enable service 
providers to scale with value-added services is addressed 
by the inventive architecture preferably through the 
provision of high-performance general purpose processors 
5 which are paralleled and which can be scaled to a 
virtually limitless degree. Alternatively, network 
processors or digital signal processors or any other 
programmable processor could be utilized as well. 
Accordingly, the broadband mid-network server of the 

10 present invention provides a system that is currently 

unrivalled in performance and which can become the prime 
mover of Internet services such as managed, secure VPNs, 
Voice over IP and Real Time Video. 

While some of the principal features and advantages 

15 of the present invention have been described above, a 
greater and more thorough appreciation of the invention 
may be attained by referring to the drawings and the 
detailed description of the preferred embodiments which 
follow. 

20 Brief Description of the Drawings 

Fig. 1 illustrates a single shelf broadband mid- 
network server according to one embodiment of the present 
invention; 

Fig. 2 is a functional block diagram of the 
25 preferred server shown in Fig. 1; 

Fig. 3 is a functional block diagram of an exemplary 
line card shown in Figs. 1 and 2; 

Fig. 4 is a functional block diagram of an exemplary 
IPE card shown in Figs. 1 and 2; 
30 Fig. 5 illustrates routed distribution to an IPE 

card; 

Fig. 6 illustrates the processing flow on an IPE 

card; 



* 
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Fig. 7 illustrates a protocol processing platform 

according to the present inventions- 
Fig. 8 is a functional block diagram of an exemplary 

buffer access controller; 
5 Fig. 9 illustrates the foxrmat of a cell received at 

an input to a BAG from a PIC; 

Fig. 10 is a functional block diagram of a preferred 

packet manager; 

Fig. 11 is an illustration of the deployment of a 
10 broad-band mid-network server at a Service Provider POP; 

Fig. 12 is an illustraion of the different kinds of 

links an ISP may want on a secure segment; 

Fig. 13 is an illustration of the system wide 

bandwidth distribution functions; 
15 Fig. 14 is an illustration of the multi-level 

policing and multi-level shaping .that occurs in the 

system; 

Fig. 15 is an illustration of router distribution, 
two level policing, routing and two level shaping; 
20 Fig. 16 is a functional block diagram of a preferred 

packet inspector; 

Fig. 17 is an illustration of the preferred 
Distributor Flow Unit; and 

Fig. 18 is a summary of the highlights of the DFU. 
25 Detailed Description of the Preferred Embodiments 

The mid-network processor of the present invention 
is preferably implemented in a single shelf system as 
shown generally in Fig. 1, and is indicated generally by 
reference character 300^ As shown in Fig. 1, the mid- 
30 network processor 300 is provided with a number of 

physical connection ("PHY") cards 302-316 through which 
packets may enter and exit the mid-network processor 300 
according to a particular communication protocol, as is 
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known in the art. For the preferred embodiment 
illustrated in Fig. 1, the mid-network processor 300 
supports the POS, ATM, and Gigabit Ethernet layer two 
protocols, although the mid-network processor may readily 
5 be configured to support additional protocols, as will be 
apparent- The PHY cards 302-316 are each associated with 
line cards 322-336, respectively, as shown in Fig. 1. As 
is well known in the art, each PHY card is media 
specific. In other words, each PHY card is provided with 

10 connectors and other components necessary to interface 
with the communication media connected thereto, and over 
which packets enter and exit the PHY card. Each line 
card is configured to process packets of the type 
received from its associated PHY card, as explained more 

15 fully below. 

The preferred mid-network processor 300 shown in 
Fig. 1 is also provided with a number of Internet 
Processing Engine ("IPE") cards 340-354, as well as two 
flash memory modules 360, 362 and four switch fabric 

20 modules 364-368. As appreciated by those skilled in the 
art, the. number of switch fabric cards required is a 
function of the switch fabric card design as well as the 
desired redundancy overall performance. Fig. 1 also 
illustrates a midplane 370 that is provided for 

25 interconnecting the various cards described above. The 
preferred mid-network processor 300 utilizes a card-based 
approach to facilitate maintenance and expansion of the 
mid-network processor 300, as necessary, but this is 
clearly not a limitation of the present invention. 

30 The manner in which packets are processed by the 

preferred mid-network processor 300 will now be described 
with reference to Fig. 2, which is a functional block 
diagram of the preferred mid-network processor 300 shown 
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in Fig. 1 (although, to simplify the illustration. Fig. 2 
does not show the PHY cards 310-316, the line cards 330- 
336 and the IPE cards 346-354 shown in Fig. 1). Packets 
enter the mid-network processor 300 via the PHY cards, as 
5 is known in the art. Each PHY card then delivers its 
packets to its associated line card through the midplane 
370. After performing initial processing of the packet, 
the line card delivers the packet again through the 
midplane to the switch fabric which, in turn, delivers 

10 the packet to one of the IPE cards for performing certain 
mid-network processing functions, such as routing 
functions, per user processing, encryption, and bandwidth 
distribution. After performing mid-network processing 
for the packet delivered thereto, the IPE card sends the 

15 packet back into the switch fabric, typically for 

delivery to one of the line cards for some additional 
processing before allowing the packet to exit the mid- 
network processor 300 through one of the PHY cards. In 
some cases, depending upon how the mid-network processor 

20 of the present invention is implemented, a single IPE 
card may be insufficient to complete the necessary mid- 
network processing functions for a packet delivered 
thereto. In this case, upon performing some processing, 
the IPE card will deliver the packet to another IPE card 

25 (rather than to one of the line cards) via the switch 
fabric for further processing. Thus, although a packet 
will typically be processed by only one IPE card, it is 
possible to process a packet in multiple IPE cards, if 
necessary. 

30 In this preferred embodiment, all of the line cards 

contain identical hardware, but are independently 
programmable. Likewise, all of the IPE cards contain 
identical hardware, but are independently programmable. 
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This contributes to the scalability and elegantly simple 
design of the preferred mid-network processor 300. 
Additional processing power can be provided to the mid- 
network processor by simply adding additional IPE cards. 
5 Similarly, additional users can be supported by the mid- 
network processor 300 by adding additional line cards and 
PHY cards, and perhaps additional IPE cards to provide 
additional processing for the newly added users, if 
necessary. 

10 The flash memory cards are provided for storing 

configuration data used by the IPE cards during system 
initialization. 

*^ Note that, as used herein, the term "packet" refers 
to any type of packet that enters or exits the mid- 
15 network processor 300, including packets input to the 
mid-network processor 300 in the form of cells (such as 
ATM cells) via an interleaved or non-interleaved cell 
stream. 

In general, each line card used in the preferred 
20 mid-network processor 300 performs a number of functions. 
Initially, the line card converts packets (possibly of 
varying lengths) delivered thereto into fixed length 
cells. In this preferred embodiment, each line card 
converts input packets (including packets represented by 
25 individual cells) into 64 byte cells- The line card then 
examines the stream of fixed length cells "on the fly" to 
obtain important control information, including the 
protocol encapsulation sequence for each packet and those 
portions of the packet which should be captured for 
30 processing. This control information is then used on the 
line card to reassemble the packet, and to format the 
reassembled packet into one of a limited number of 
protocol types that are supported by the IPE cards. 
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Thus, while any given line card can be configured to 
support packets having a number of protocol layers and 
protocol encapsulation sequences, the line card is 
configured to convert these packets into generally non- 
5 encapsulated packets (or, stated another way, into 
packets having an encapsulation sequence of one) of a 
type that is supported by each of the IPE cards. The 
line card then sends the reassembled and formatted packet 
into the switch fabric (in the form of contiguous fixed 

10 length cells) for delivery to one of the IPE cards that 
was designated by the line card for further processing 
that particular packet. 

Although the fixed length cells which comprise a 
packet are arranged back to back when the packet is 

15 delivered to the switch fabric by a line card, the cells 
may become interleaved with other cells destined for the 
same IPE card during the course of traversing the switch 
fabric. As a result, the cell stream provided by the 
switch fabric to any given IPE card may be an interleaved 

20 cell stream. Thus, the IPE card will first examine this 
cell stream "on the fly" (much like the cell stream 
examination conducted by the line cards, explained above) 
to ascertain important control information. The IPE card 
then processes this control information to perform 

25 routing look-ups and other mid--network processing 

functions for each packet delivered thereto. The control 
information is also used by the IPE card to reassemble 
each packet, and to format each packet according to the 
packet's destination interface. The IPE card then sends 

30 the reassembled and formatted packet back into the switch 
fabric in the form of contiguous fixed length cells for 
delivery to one of the line cards (or for delivery to 
another IPE card, in the case where additional mid- 
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network processing fvinctions must be performed for the 
packet in question) . 

As noted above, although the cells of any given 
packet may enter, the switch fabric in a back to back 
5 arrangement, these cells may become interleaved with 
other cells during the course of traversing the switch 
fabric. Thus, the stream of cells provided by the switch 
fabric to any given line card may be an interleaved cell 
stream. Accordingly, a line card will first examine this 

10 cell stream "on the fly" to ascertain important control 
infoinnation that will be used primarily to reassemble 
packets, and to format the reassembled packets for their 
destination interfaces. Additional processing of 
outbound packets is also conducted on the line card for 

15 PHY scheduling and bandwidth distribution purposes. 

. While the preferred mid-network processor 300 of the 
present invention has been described as delivering 
packets from a line card to an IPE card and then back to 
a line card (or to one or more additional IPE cards) , the 

20 mid-network processor 300 can also be configured to route 
cells arriving over an ATM interface on one line card 
through the switch fabric and directly to another line 
card ATM interface, and can therefore function as an ATM 
switch . 

25 Fig- 3 illustrates an exemplary line card 380 used 

in the preferred mid-network processor 300 of the present 
invention. As shown therein, the line card 380 
preferably includes an ingress side (i.e., the left half 
of Fig. 3) and an egress side (i.e., the right half of 

30 Fig. 3) . When packets are provided to the ingress side 
of the line card from the line card's associated PHY 
card, the packets are first provided to a packet 
inspector chip ("PIC") 400 which converts the packets 
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(which may already be represented by individual cells 
such as ATM cells) into fixed length cells. In this 
preferred embodiment, the fixed length cells are 64 byte 
cells that are 8 bytes wide and 8 bytes long. Thus, a 
5 "cell time," in the context of cells propagating within 
the preferred mid-network processor 300, corresponds to 8 
clock cycles, as appreciated by those skilled in the art. 
The PIC 400 then examines the stream of fixed length 
cells "on the fly" to identify the "classification" (that 

10 is, the protocol encapsulation sequence), capture matrix, 
and other control information for each packet (as 
described more fully in copending Application No. 
09/494,235 filed January 30, 2000 entitled "Device and 
Method for Packet Inspection, " the disclosure of which is 

15 incorporated herein by reference) . More specifically, 
the preferred PIC 400 generates a control cell for each 
examined cell of a packet, and each control cell 
represents the control information that has been 
determined thus far for the corresponding packet. Thus, 

20 the PIC 400 outputs both the stream of fixed length cells 
that was produced before this stream was examined "on the 
fly" therein, as well as corresponding control cells. As 
shown in Fig. 3, these control and data cells are then 
provided by the PIC 400 to four preferably identical 

25 buffer access controllers ("BACs") 402-408. Each BAG 

stores a different quarter (i.e., 25%) of the data cells 
received from the PIC 400 in its corresponding cell 
buffer ("CB") . 

Each control cell output by the PIC 400 also 

30 includes a protocol processing unit ("PPU") identifier 
which identifies a PPU associated with a particular BAG 
for processing that control cell. Note that each PPU, in 
this preferred embodiment, preferably comprises two 
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general purpose central processing units ("CPUs"), as 
shown in. Fig. 3. Alternatively, a PPU could comprise one 
or more network processors, digital signal processors, or 
any programmable processors. The BACs 402-408 each 
5 examine the PPU identifiers contained in the control 

cells delivered thereto over a bus by the PIC 400. When 
a BAG determines that the PPU identifier . in a particular 
control cell is identifying the PPU associated with that 
BAG, the BAG will forward the control cell to its 

10 associated PPU for processing, as described more fully 
below. Thus, while every BAG 402-408 in this preferred 
embodiment stores a quarter of every data cell in their 
associated cell buffers, each control cell output by the 
PIG 400 is acted on by only one BAG and its associated 

15 PPU. As a result of being so processed, the size of the 
control cell is much smaller than the typical size of a - 
packet. This can significantly increase the utilization 
of the processor by reducing the I/O bandwidth which is 
the typical limiting factor in processor use. In this 

20 preferred embodiment, all control cells corresponding to 
a specific packet (and, more generally, to a specific 
user) are processed by the same BAG PPU on the line card 
380. 

Note that the PPU assigned by the PIG 400 for any 
25 given packet is performed according to configuration and 
control information received by the PIG 400 from a master 
PPU ("MPPU") 410, and can be changed by the MPPU 410 over 
time as necessary for PPU load balancing on the line card 
380. 

30 The PIC 400 also keeps track of the available memory 

addresses in the cell buffers associated with the BACs 
using a free buffer ("FB") list 412, and also keeps track 
of where each data cell is stored in the cell buffers 
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with respect to other cells of the same packet using a 
link list 414. 

When a control cell is processed within a particular 
BAG PPU, the PPD produces a new control cell to be 
5 provided to a packet manager ("PM") 420 which is in 
communication with the PIC 400 and the BACs 402-408. 
Included in this control cell provided to the PM 420 is a 
dequeue pointer which designates the location of the 
first cell of a packet that is to be dequeued and sent to 

10 the PM 420 along with the second and subsequent cells of 
that packet (if applicable) . The packet manager 420 then 
forwards this dequeue pointer back to the PIC 400, which, 
in turn, provides instructions to the BACs 402-408 to 
dequeue each quarter cell of the designed packet in 

15 sequence using the information previously stored by the 
PIC 400 in the link list 414. Thus, the designed packet, 
is reassembled as it is dequeued and delivered to the 
packet manager 420. 

At this point in the processing, the packet manager 

20 420 stores the cells of the reassembled packet in its own 
cell buffer 422 (using a free buffer list 424 and link 
list 426) . The packet manager 420 processes the control 
information it received for that packet from one of the 
BAC PPUs and then formats the packet according to this 

25 control information by modifying or augmenting the packet 
header as the cells of the packet are dequeued from the 
cell buffer 422. This process and additional details of 
the preferred packet manager 420 are described more fully 
in copending Application No. 09/494,236 filed January 30, 

30 2000 entitled "Device and Method for Packet Formatting," 
the disclosure of which is incorporated herein by 
reference. The packet manager 420 also appends a header 
to each of the 64 byte cells that constitute the 
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reassembled and formatted packet, and these headers will 
be used by the switch fabric for routing the cells 
therethrough. The packet manager 420 then forwards the 
cells of the packet in sequence to a DDASL 430, which is 
5 provided for managing cell traffic into and out of the 
switch fabric for the line card 380. Typically, the 
UDASL 430 then forwards the packet cells into the switch 
fabric for delivery to an IPE card that will perform mid- 
network processing functions for the packet in question. 

10 This IPE card is preferably designated by the BAG PPU 
that prepared and forwarded control information to the 
packet manager 420. 

Also shown in Fig. 3 is a 9-port Ethernet switch 450 
which provides for interprocessor communications between 

15 the eight FPUs on the line card 380 (i.e., 4 FPUs on the 
ingress side and 4 PPUs on the egress side) and the MPPU 
410 for purposes of load balancing, hardware monitoring 
and bandwidth distribution, and for sharing user and 
configuration information. The bandwidth distribution 

20 process and the preferred hardware are described more 
fully in copending Application No. 09/515,028 filed 
February 29, 2000 entitled "Method and Device for 
Distributing Bandwidth," the disclosure of which is 
incorporated herein by reference. 

25 Fig- 4 illustrates an exemplary IPE card 500 used in 

the. preferred mid-network processor 300 of the present 
invention. The hardware layout of the IPE card 500 is 
similar to the hardware layout on the ingress side (and 
the egress side) of the line card 380 shown in Fig. 3. 

30 That is, the IPE card 500 is also provided with a UDASL 

501 that delivers a typically interleaved cell stream 
received from the switch fabric to a PIC 502. The PIC 

502 is in communication with four BACs 504-510 that are 
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in communication with a PM 512. Thus, the primary 
difference between the preferred IPE card 500 and either 
side of the preferred line card 380 is the processing 
that is performed therein, even though this processing is 
5 performed with similar hardware- It should thus be 
apparent that the present invention provides, amongst 
other things, an inventive hardware module that can be 
programmed to perform requisite processing either on the 
ingress side or the egress side of a line card, or on an 

10 IPE card- This contributes to the configurability and 
scalability of the preferred mid-network processor 300, 
which can be reconfigured as necessary (both through 
programming and/or by adding additional lines cards 
and/or IPE cards) to accommodate additional users and/or 

15 to provide additional processing power. 

Much like the PIC 400 resident on the ingress side 
of the preferred line card 380, the PIC 502 provided on 
the preferred IPE card 500 is used to inspect the stream 
of fixed length cells provided thereto by the switch 

20 fabric "on the fly" to ascertain control information for 
each packet to be processed on the IPE card. In most 
cases, this control information was added to the packet 
by the PM 420 on the ingress side of the line card that 
forwarded the packet to this particular IPE card. The 

25 PIC 502 outputs the stream of data cells to the four BACs 
504-510, each of which is configured to store a different 
quarter of each data cell in its corresponding cell 
buffer (note that each BAG on the preferred IPE card 500 
has two PPUs associated therewith, whereas only one PPU 

30 is associated with each BAG on the preferred line card 

380) . The PIC 502 also outputs control cells to the BACs 
504-510, where each control cell contains a. PPU 
identifier that designates one of the two PPUs associated 
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with a particular BAG for processing that control cell on 
the IPE card to perform mid-network processing functions 
for the corresponding packet. In this preferred 
.embodiment, all control cells corresponding to a specific 
5 packet (and, more generally, to a specific user) are 
processed by the same BAG PPU on the IPE card 500. 

For any given packet, the PPU that processed control 
information for that packet on the ingress side of the 
line card is also responsible for determining to which 

10 IPE card and, more specifically, to which PPU on a 
particular IPE card, the packet should be sent for 
further processing. 

After a BAG PPU on the IPE card processes the 
control information for a particular packet, the PPU 

15 sends a control cell back to the PM 512, which then 

cooperates with the PIC 502 to dequeue the quarter cells 
of that packet in sequence from the cell buffers 
associated with the BACs 504-510. Upon receiving the 
constituent cells of a reassembled packet and storing 

20 these cells in its own cell buffer 514 (using a link list 
516 and a free buffer list 518), the PM 512 processes the 
control cell received from the BAG PPU to format the 
reassembled packet according to its destination interface 
before forwarding the reassembled formatted packet back 

25 into the switch fabric for delivery to its destination 
line card (or another IPE card, in the case where 
additional processing of the packet is required) . 

Also shown in Fig. 4 is a 9-port Ethernet switch 550 
which, like the Ethernet switch provided on the preferred 

30 line card 380, provides for interprpcessor communications 
between the eight PPUs and an MPPU 530 on the IPE card 
500 for purposes of load balancing, hardware monitoring 
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and bandwidth distribution, and for sharing user and 
configuration information. 

Referring again to Fig. 3, it can be seen that the 
egress side af the exemplary line card 380 is also 
5 provided with a PIC 600, four BACs 602-608, and a PM 610. 
Upon receiving a possibly interleaved stream of fixed 
length cells from the switch fabric via the ODASL 430, 
the PIC 600 examines this cell stream "on the fly" to 
ascertain control information (including control 

10 information that may have been added to the packet header 
by the PM 512 on an exemplary IPE card 500) . The PIC 600 
then forwards the data cells to the BACs 602-608 for 
storage in their corresponding cell buffers, and forwards 
corresponding control cells for each packet to one of the 

15 BAC PPUs' (typically assigned by an IPE card BAC PPU that 
previously processed control information for the same 
packet) for further processing. The assigned BAC PPU 
then performs additional packet processing, primarily for 
traffic shaping, PHY card scheduling and bandwidth 

20 distribution on that PHY card. This process and the 

preferred hardware are described more fully in copending 
Application No. 09/511,059 filed February 23, 2000 
entitled Method and Device for Data Traffic Shaping," 
the disclosure of which is incorporated herein by 

25 reference. Upon processing the control information 
received from the PIC 600, this BAC PPU produces and 
forwards a control cell to the packet manager 610, which, 
in turn, dequeues the quarter cells of the corresponding 
packet in sequence from the cell buffers associated with 

30 the BACs 602-608 in cooperation with the PIC 600. The PM 
610 then stores the constituent cells of the reassembled 
packet in its own cell buffer 612 (using a link list 614 
and a free buffer list 616) , and formats the packet for 
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its intended destination before forwarding the 
reassembled formatted packet to the PHY card associated 
with this line card for outputting the packet from the 
mid-network processor 300. 
5 A description of one preferred implementation of the 

broadband mid-network server described above will now' be 
provided, wherein the following terms have the following 
meanings : 

Cardid: An 8 bit number that uniquely identifies an IPE 

10 or Line Card in the system. 

Flowld: A 10 bit number whose lower (least significant) 
8 bits contain a Cardid, and whose upper (most 
significant) 2 bits identify the priority (class) of the 
traffic sent through the switch fabric to this card using 

15 this Flowld. (In the switch fabric, this field is 12 

bits, but our implementation only uses the least 
significant 10 bits.) 

User: A datalink (layer 2) interface. Examples include 
ATM virtual circuits, PPP sessions (over SONET, Ethernet, 

20 or ATM), and MPLS label switched paths. 

Userld: A 32-bit value that can be used as a system-wide 
pointer to user configuration and state information. 
Since multiple cards (one or more IPEs and one Line Card) 
can store information about a user, it is possible to 

25 have multiple Userlds that refer to a single user. The 

upper (most significant) 8 bits of the value represent 
the Cardid of the card which contains the user 
information being identified. The next 4 bits represent 
the PPUID of the PPU on the card where the information is 

30 stored, and the lower (least significant) 20 bits 

represent the CID assigned by that card to the user. The 
CID is used as an index into the PPD's table of user 
information . 

LCUserld: A Userld in which the Cardid identifies a Line 
35 Card. 
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Primary Userld: A Oserld in which the Cardid and PPUID 
identify the PPD on an IPE with has the primary 
responsibility for managing a user. 

Secondary Userld: A Oserld in which the Cardid and PPUID 
5 identify an IPE PPU other than the one identified by the 

Primary Dserld 

Small User: A user whose ingress packet stream is 
processed entirely by a single IPE PPD. Small users do 
not have Secondary Userlds. 
10 Large User: A user whose configured bandwidth is too 

high for his ingress packet stream to be processed by a 
single IPE PPU. All large users have one or more 
Secondary Userlds . 

Logical Link: A group of users of the same type (i.e.: a 
15 group of ATM Virtual Circuits) . If the Logical Link is a 

group of PPPoE sessions over ATM, the Logical Link must 
be an ATM Virtual Circuit. 

CSIX Header: The header of a CSIX (i.e.. Common Switch 
Interface) cell- The CSIX Header is separate from the 64 
20 byte cell payload. 

Cell Header: The first two bytes of the 64 byte payload 
of a CSIX cell. 

PIE Header: The 6 bytes immediately following the 
Cell Header of the first cell of a packet. 
25 Overview : 

In this particular implementation, the server system 
preferably comprises one or more rack mountable system 
units (i.e., shelves). The system also contains at least 
one line card, exactly as many PHY cards as line cards, 

30 and at least as many IPE cards as line cards. Also, each 
shelf of the system contains preferably three switch 
fabric cards and two flash disk cards. Each line card is 
uniquely associated with a particular PHY card. However, 
there is no particular association between line cards and 

35 IPE cards. 
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Each IPE card can be thought of as an independent router, 
with one or more IP addresses associated with it. Each Layer 
2 (datalink) interface (referred to as a "user") provided by a 
line card is associated with exactly one IPE card (more 
5 specifically, exactly one PPU on one IPE card) - Different 

users from the same line card can be associated with different 
PPUs on different IPE cards, and a particular PPU can have 
users from multiple line cards. 

Since it is possible for multiple Layer 2 protocols to be 

10 encapsulated within each other (for example, 

PPP/Ethernet/ATM) , there is an exception to the ^^one user, one 
PPU" rule. In this case, the inner-most levels of 
encapsulation, each of which being layer 2 interfaces (users) 
in their own right, can be associated with different PPUs 

15 within an IPE card, or even PPUs on different IPE cards, thus 
causing traffic from the outer levels of encapsulation to be 
split among multiple PPUs or IPE cards. It is also possible 
for outer layers to be encapsulated layer 3 traffic as well as 
layer 2 traffic (for example, an Ethernet /ATM virtual circuit 

20 can carry IP as well as PPPoE packets). In this case, all the 
layer 3 traffic will be associated with a single PPU (a user) , 
but the encapsulated layer 2 datalinks (users) can each be 
associated with a different IPE card. 

The set of all users on the system is preferably 

25 distributed as evenly as possible across all the IPE cards in 
the system. Within an IPE, the MPPU stores the per-user 
information for the users assigned to that IPE and distributes 
those users across its PPUs . Each PPU stores a copy of the 
per-user information assigned to it. Thus each user is 

30 associated with one and only one IPE card and one and only one 
PPU on that IPE. This PPU's copy of the user's configuration 
and state information can be uniquely identified on a system- 
wide basis by the Primary User Id. 

The architecture of this preferred implementation is 

35 based on line cards, PHY cards, a switching fabric, internet 
processing engines (IPE) and flash memory modules, as was 
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described generally above. The line cards terminate the link 
protocol and distribute the received packets based on user, 
tunnel or logical link information to a particular IPE through 
the switching fabric. The procedure of forwarding a packet to 
5 a particular IPE and PPU will be denoted as "routed 

distribution." A luidplane is also used to connect the 
different cards. The preferred line card and the preferred 
IPE card were described above with reference to Figs. 3 and 4. 
The system is comprised of a set of hardware components, 

10 as described, which can be used to configure a system for a 
wide variety of applications as well as throughput 
requirements cost effectively. The preferred switch faibric 
and scheduler support cell switching at OC-192 speeds, and the 
switch fabric is both fully redundant and highly scalable. 

15 The preferred IPE cards have the following attributes: high 
performance protocol processing engine; manages users, tunnels 
and secure segment groups; supports policing and traffic 
shaping; implements highly sophisticated QoS with additional 
support for differentiated services; supports distributed 

20 bandwidth management processing; and supports distributed 

logical link management, able to do NAT, packet filtering and 
firewalls . 

The preferred line cards have the following attributes: 
packet lookup processing; protocol identification; scheduling; 

25 supports distributed bandwidth management processing; 

multi-I/F support (ATM, GE, POS) ; and AAL-5 Processing (CRC 
check and generation) . 

The preferred PHY cards have the following attributes: 
line termination for rates up to OC 192c; ATM - Layer 

30 Processing; ATM - SONET Mapping; POS - SONET Mapping 

(including EEC checksum computation) ; GE - MAC and PHY 
Processing; and support the following line cards: ATM: 4x OC- 
48, 8x OC-12, 16x OC-3; POS; lxOC192, 4x OC-48, 16x OC-12; and 
GE: 8/lOx GE. Additionally, the overall system preferably has 

35 the following attributes: high availability; 1+1 switch 
fabric and scheduler redundancy; 1+1 control system unit 
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redundancy; all field replaceable units are hot-swappable; N+1 
AC power supply redundancy; and N+1 fan redundancy. 

One purpose of routed distribution is to forward a packet 
to a particular PPD within an IPE. The key benefits of this 
5 approach are: incremental provisioning of compute power per 
packet; allows load distribution based on the packet 
computation needs for a particular user or tunnel; user and 
tunnel configuration information can be maintained by one 
single processor thus minimizing the inter-process 
10 communication needs; and allowing the portability of single 
processor application S/W onto the system. 

Fig. 5 illustrates the distribution of packets to a 
particular IPE. A packet is received from a line card. The 
line card exaiaines the packet and forwards the packet based on 
15 the IP source or destination address, the user session ID, or 
the tunnel ID. The IPE receives the packets and hands it over 
to the PPCJ specified by the line card. 

The line cards and the IPE host the flexible protocol- 
processing platform. This platform is comprised of a data path 
20 processing engine and the already mentioned protocol- 
processing unit. The separation of data path processing from 
protocol processing leads to the separation of memory and 
compute intensive applications from the flexible protocol 
processing requirements. A clearly defined interface in the 
25 form of dual-port memory modules and data structures 

containing protocol specific information allows the deployment 
of general-purpose CPU modules for supporting the ever 
changing requirements of packet forwarding based on multi- 
layer protocol layers. 
30 The protocol-processing platform can be configured for 

multiple purposes and environments. That is, it supports a 
variable number of general purpose CPOs which are used in the 
context of this architecture as Protocol Processing Units 
(PPD) . One of these CPUs is denoted as the Master Protocol 
35 Processing Unit (MPPU) . 
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The data path processing unit extracts, in the packet 
inspector, all necessary information from the received packets 
or cells and passes this information on to a selected PPU via 
one of the buffer access controller devices- The cells 

5 themselves are stored in the cell buffer and linked together 
as linked lists of cells, which form a packet. Once a PPD has 
selected a packet for transmission, it passes the pointer to 
the packet and the necessary formatting and routing 
information to the data path processing unit. This enables 

10 the formatting and the segmenting of the packet. The packet 
is then forwarded either as a whole or segmented based on the 
configured interface. 

Each PPD is associated with one dual-ported memory, where 
. one port is controlled by the data-path processing unit and 

15 the other by the corresponding PPD. Each dual-ported memory 
contains two ring buffers, where one ring buffer is used to 
forward protocol specific information from the data path to 
the PPD and the other is used for the other direction. The 
ring buffer for passing on protocol specific information to 

20 the PPD is called the receive buffer. The other buffer is 

called the send buffer. Two pointers are maintained for each 
ring buffer. The write pointer for the receive buffer is 
maintained by the data path processing unit while the read 
pointer is set by the PPD. The send buffer's write pointer is 

25 controlled by the PPD and the read buffer by the data path 
processing unit. 
The PHY Card ; 

The PHY card terminates the incoming transmission line. 
It also performs clock recovery and clock synthesis. Optical 

30 signals are converted into a parallel electrical signal which 
is then an input to a physical framer device which maps the 
incoming bit stream into the transmitted physical frame. 
Finally the physical layer of the corresponding link protocol 
processes the physical frames. In addition, link layer 

35 protocol processing is performed in order to provide a common 
packet interface to the line card. On the transmission side. 
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the packets or cells are mapped into physical frames. These 
frames are then encoded into the corresponding physical layer 
format and sent over the optical fiber to the receiving peer. 
The physical layer format is preferably either SONET or 
5 Gigabit Ethernet. The link layer format is preferably GE, ATM 
. or PPP for POS. 
The Line Card : 

The line card performs packet forwarding for the egress 
and ingress path. Full duplex 10 Gbit/s throughput is 

10 provided. The line' card interfaces to the PHY cards and the 
switch fabric card. The Line Card is preferably configured 
for either POS-PHY or UTOPIA III interface to the PHY card. 
The Line Card preferably hosts two Protocol Internet Engine 
(PIE) chip sets. On the ingress side, one PIE chip set 

15 supports four protocol-processing units (PPU) and one MPPU. 
The Four PPUs perform routed distribution to the various IPEs 
. in the system. They also provide traffic shaping and 
scheduling of flows to the switching fabric. The remaining 
MPPU is used for overall control and supports the distributed 

20 bandwidth allocation protocol of the- switching fabric. 

The Packet Inspector (PI) first examines incoming cells 
or packets and the protocol information is extracted based on 
matched patterns in the data flow. This information is then 
made available to the PPU which is responsible for processing 

25 the incoming packet. Cells or packets from a PHY card are 

processed by a particular PPU based on a chosen configuration. 
This configuration depends upon the configuration of the PHY 
card itself and upon the protocol supported by the PHY card. 
The other PIE chip set, processing the egress flow, is 

30 preferably responsible for cell, assembly from the switch 
fabric and packet scheduling for multiple physical ports. 
Additional support for AAL5 processing is provided for ATM 
flows. The MPPU from the ingress path is shared for 
configuration, maintenance and cell extraction of the egress 

35 flow. 
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The communication channel provides signaling and 
connection setup control for the ATM PHY card. The PHY card 
informs the Line Card about the physical layer status and 
reports alarm and error conditions. 
5 The ingress packet processing preferably involves: 

Packet Assembly for ATM traffic (AAL5 processing) ; Protocol 
Identification (Packet Data Inspection); Routed Distribution; 
Scheduling of traffic flows through switching fabric; Buffer 
management for ingress cell buffers; and cell scheduling for 

10 the switch fabric. 

The egress packet processing preferably involves: 
Traffic Shaping; Packet Assembly for switch fabric flow; MPHY 
Buffering; Cell Scheduling for ATM with multiple physical 
interfaces with AAL5 processing (CPCS, SAR) ; and Packet 

15 Scheduling for POS .with multiple physical interfaces. 

The Internet Processing Engine (IPE) provides the 
functionality for protocol processing, user management, tunnel 
management and secure segmentation. It receives the packets 
from the switching/ enforces the service level agreements 

20 (SLA's), performs packet classification, filtering and 

forwarding, and finally schedules the packet for transmission 
to the requested interface. 

The PI is part of the Packet Internet Engine (PIE) chip 
set, which consists of the Packet Inspector^ the Buffer Access 

25 Controller, and the Packet Manager. Together with the sixteen 
PPas and the MPPU, the PIE chip set provides a powerful 
Protocol Processing unit. The PIE chip extracts informative 
protocol information and forwards it to the PPUs and the MPPO 
based on the routed distribution decision made in the Line 

30 Cards. The chosen PPD processes this information and performs 
all necessary packet processing. This includes, besides 
forwarding and filtering, policing, and packet formatting. 
The MPPU controls the IPE and is negotiating with the units in 
the system the bandwidth allocation of the switch fabric. It 

35 also provides bandwidth management for the configured logical 
links. The MPPU manages its connections by assigning users and 
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tunnels to individual PPOs for forwarding processing from the 
Line Card to a particular IPE. Once a connection between the 
MPPD and Line Card is set up, all packets belonging to such a 
connection are forwarded from the Line-Card to the chosen PPU. 
5 A PPU is chosen based on the already assigned 

connections, their bandwidth and the bandwidth and QoS 
required for the new connection. Connectionless traffic 
(Internet to Internet) is mapped onto an internal connection. 
If more bandwidth is needed than one PPU can manage, the 

10 packets will be distributed over multiple PPUs. 

The functionality of the IPEs include: User Managements- 
Tunnel- Management; Logical Link Management; Support for Secure 
Segmentation; Policing; QoS Control with Diff Service Support; 
Buffer Management; IPv4, IPv6 Forwarding; Packet 

15 Classification; Packet Filtering with support for user 

Filters; Celox Management Database Support; Packet Formatting, 
and NAT. 

The Protocol Internet Engine Chip Set (PIE) : 

The Protocol Internet Engine (PIE) provides the data path 
20 processing capabilities for the server system at OC-192c 
rates. The PIE chip set comprises three chips. These chips 
result in a very high performance packet processing system 
together with an interface contftoller and multiple general 
purpose CPUs. 

25 Each cell is preferably transferred into the buffer 

through four buffer access controllers ("BACs") in order to 
increase. the bandwidth to the PPUs and to increase the 
bandwidth to the external cell buffers. Different portions of 
the same cell are written to the cell buffers attached to the 

30 different BACs. However, the captured portion of the data is 
sent to just one of the PPUs. 

The preferred BAC unit is shown in Fig. 8. The RSU 
receives incoming data, reformats the data ta an internal 
format, performs a parity check for incoming data, and also 

35 performs synchronization control. The preferred format of a 

cell received by the BAC from the packet inspector is shown in 
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Fig. 9. Referring again to Fig. 8, the Cell Filter unit 
extracts control information from the cell and sends the cell 
data to the BAU along with the indication of which portion of 
the cell has to be stored in this cell buffer. The CFO also 
5 sends the cell data stream to the PTU which translates the 
PPUID to the appropriate FPU and thence to the CCQ where, 
based on the PPUID and the capture matrix, the control cell is 
extracted from the data cell CCU and stored in the CBD. The 
CMU then transmits the control cell to the appropriate FPUs 

10 through a dual port RAM interface. 

When the packet has to be dequeued, the control cell 
corresponding to the packet is sent by the PPU which processed 
that user to the PM along with the dequeue pointer. This is 
received by the BEU of the PM, as shown in Figure 8. 

15 The control cell data stream (shown as the narrow arrow 

in Fig. 8) then goes to the ICO where it is stored while the 
DSU does deficit round robin scheduling of the data packets 
corresponding to the control packets in order to distribute 
bandwidth equitably to the BACs for sending out packets- In 

20 addition, the dequeue pointer corresponding to the packet to 
be dequeued is sent to the PID from where it is transmitted to 
the PI where it is received at the PIU and passed on to the 
BMU. In the BMU, the dequeue pointers are stored in a FIFO 
while the previous packets are being dequeued. The dequeue 

25 pointer information is passed onto the BACs and the BAU in the 
BACs. dequeues the packet and passes it through the PMU to the 
packet manager. A packet is dequeued by dequeuing all the 
cells comprising the packet which are held in the form of a 
linked list. Data packets from the data packet stream (shown 

30 as the thick arrow in Fig. 8) undergo AAL5 processing (should 
they need it) in the APU, and are stored in the IDD buffer. 
The FAD reformats packets into 64 bit slices and controls 
dequeuing from both the IDD and the DSU's DPRAM in accordance 
with the PFD- In order to ensure matching of the control 

35 packet with the data packet, a sequence number is used at the 
beginning of both the data and the control cells. Both the 
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control and data streams enter the PFD where they are 
formatted and sent to the TID to be sent to the phy cards or 
the switch fabric. 

The PIE chip set can be configured for multiple purposes 
5 and environments. That is, it supports a variable number of 
general purpose CPUs which are used in the context with the 
PIE chip set as Protocol Processing Units (PPD) . One of these 
CPUs is reserved for maintenance and control purposes and is 
denoted as MPPU. 

10 The PIE chip set implements all necessary functions in 

order to hide all data path processing from the actual 
protocol processing functionality. The PIE chip set extracts 
all necessary information from the received packets or cells 
and passes this information on to a selected PPD. The cells 

15 are then stored in the cell buffer and linked together as 

linked lists of cells, which form a packet. Once the PPU has 
selected a packet for transmission, it passes the pointer to 
the packet and the necessary formatting and routing 
information to the PIE chip set. This allows formatting and 

20 segmenting of the packet. The packet is then forwarded to the 
MPHY scheduler as a whole or segmented based on the configured 
interface . 

Each PIE chip set is differently configured. The PIE chip 
set on the IPE supports as many as 8 PPUs and 1 MPPU. 4 PPDs 
25 and 1 MPPU will support the PIE chip set on the ingress side 
of the Line Card, and an ecjual number on the egress side of 
the Line Card. 

The characteristics of the preferred PIE are as follows: 
Three Chip Chip-Set; Full Data-path processing in hardware; 

30 Support for distributed .protocol processing by general purpose 
CPU modules; Highly scalable compute power per packet (up to 
64 PPUs can be supported) ; Flexible interface support with 
MPHY scheduling; AAL-5 Processing; SAR Sublayer: Assembly and 
Segmentation for up to 256K connections; CPCS Sublayer: CRC 32 

35 generation and check, padding control, and length field 

control; Internal Packet Processing; Checksum computation and 
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check; Length field control; Padding control; Micro- 
programmable Packet Inspection Engine; Supports any layer 
packet inspection; Supports byte matched pattern processing; 
Supports bit matched pattern processing; Results are made 
5 available to protocol processing units; Supports extraction of 
any portion of packet for protocol processing; IPv4/IPv6 
Header Checksum; Congestion Avoidance Support; EPD; PPD; 
Internal Back-pressure control; Linked List Control; Supports 
up to 8 million 64 byte cells (initially a million) ; Links 

10 cells together to form a packet; Garbage Collection; Assembly 
aging control; Buffer Access; Parity generation and check for 
signal integrity; Cyclic access for data rates up to 12Gbit/s; 
PPD Access; Dual-Port access control for up to 8 dual-port 
RAMS each with 512/256 KByte memory; Support for dual-port RAM 

15 data synchronization; Dual-ring buffer control for each dual- 
port RAM for data exchange; Threshold-based access control for 
writes to ring buffer; Support for up to 24 Gb/s throughput 
(bi-directional) ; Back-pressuring in case of buffer overflow; 
Cyclic Packet Scheduling; Packet Scheduling for cyclic access 

20 control with support for data rates up to 12Gb/s; Micro- 
programmable packet formatter; Supports insertion, removal and 
overwriting for any byte in a packet at OC-192 speeds; 
Supports IPv4/IPv6 Header Checksum generation; Support UDP/TCP 
checksum generation; Cell Scheduling, Buffering and Linked 

25 List Management; Supports cell buffering for up 512K cells; 
and supports scheduling for up to 1024 queues. 

Together with the PPOs, the preferred PIE supports: 
Packet Classification : 

Based on Layer 3,4, — Information (any layer); Packet 
30 Filtering; User programmable filters; Group filters; Firewall 

processing; Packet Forwarding; IPv4 Lookup Processing; IPv6 

Lookup Processing; Tunnel Forwarding; Buffer Management; 

Dynamic Thresholding on a per user and assigned rate basis; 

Support for up to 8 million Cell Buffer (initially a million) ; 
35 Congestion avoidance with Early Packet Discard (EPD) ^ Partial 

Packet Discard (PPD), Selective Packet Discard; Policing; Per 
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User and Logical Link; Enforcing traffic contracts based on 
SLA; Traffic Shaping; Per User and Logical Link; Support for 
traffic contracts based on SLAs; Support for Real-^time traffic 
(low delay traffic) ; QoS Control; Supported for differentiated 
5 services; Multiple priorities per user; Flow based queuing 
(not initially supported); Bandwidth Management; Distributed 
processing for allocation of bandwidth on switch-fabric links 
including MPHY links; Distributed processing for allocation of 
bandwidth for logical link management; User Management for up 
10 to 512K users; Tunnel Management for up to 128K users; L2TP; 
IPSec; Multi Protocol Processing; and Support for any 
protocol • 

Traffic Management : 

Traffic Management for an Internet access system is 

15 complex due to the involvement of various system interfaces. 

A system might be connected to users, the Internet backbone, a 
Local Area Network with file and Web servers, and a 
Metropolitan Area Network (MAN) which gives access to local TV 
and media servers as shown in Fig. 11. Each link has 

20 . different link properties with respect to available bandwidth 
and Dollar per Megabyte. This means that a user's share of 
bandwidth on a particular link has to be based on the property 
of this link, A user might get more bandwidth share on the MAN 
link than on the backbone link due to the fact that more 

25 bandwidth at a cheaper price is available on the MAN link than 
on the backbone link. The same is true for bandwidth 
wholesaling of the preferred system to multiple ISPs who would 
like to resell bandwidth to their customers. The enabling 
technology for this model is Secure Segmentation. This model 

30 has also led to the introduction of logical link groups. A 
logical link group can be assigned to a secure segment based 
on the bandwidth needs of the considered secure segment for a 
particular link as shown in Fig. 12. This means that not only 
user allocation has to be considered but also logical link 

35. bandwidth needs to be included. Therefore, bandwidth is 

distributed based on traffic class, user, and logical link 
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group. This supports the wholesaling model and takes into 
account over-subscription requirements in order to support QoS 
including differentiated services. 

The preferred system represents a highly distributed 
5 system. In such a system, resources have to be allocated based 
on the requirements of the traffic of each component. That 
means in general that each component has to take part in a 
distributed computation method in order to allocate the 
resources. The traffic management requirements for bandwidth 

10 allocation within the preferred system will have to include 
bandwidth negotiation for the various flows through the 
switching fabric. One also has to consider the specific 
requirements to support the above-introduced concept of 
logical link groups. Since logical link groups are managed in 

15 a distributed manner, bandwidth information has to be 

exchanged between the entities managing one logical link as 
shown in Fig. 13 • Buffer management and QoS Control is an 
integral part of the overall traffic management scheme 
implemented in the preferred server system. Due to the large 

20 buffer, the system has to maintain on various different places 
in the distributed system a sophisticated buffer management 
scheme which has to be implemented and supported by QoS 
control in order to support differentiated services and other 
traffic flow specific requirements 

25 Policing - Traffic Shaping : 

Policing and Traffic Shaping have closely related 
functionality. Policing ensures that the incoming stream does 
confonn to the negotiated link parameters for a logical link 
group as well as the user of the incoming link. Traffic 

30 shaping enforces the link parameters for the outgoing traffic 
stream based on the outgoing user, the logical link group and 
the link itself. Fig. 14 is intended to illustrate the need 
for policing as well as traffic shaping. An incoming traffic 
stream is shaped (policed) in order to enforce the traffic 

35 contracts of a user for the considered link and logical link. 
Before the traffic is forwarded to another link, the traffic 
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contracts for this particular link have to be enforced. This 
traffic contract might be much different from the traffic 
contract of the incoming stream. Consider the case where a 
user requests information over the Internet backbone link. 
5 The bandwidth allocated on this link for this user might be 
500 Kbit/s. The logical link bandwidth for the corresponding 
secure segment might be set to 10 Mbit/s. If the user's 
access link to the system uses an ATM connection with an 
assigned rate 1 Mbit/s and no policing is enforced, the user 

10 could use the full 1 Mbit/s. This is possible since the 
traffic shaped onto the user ATM link allows the user to 
transmit the higher rate. Therefore, it is necessary to police 
the incoming traffic and the other for shaping the traffic for 
a particular link. 

15 Fig. 15 shows the schematic implementation of the policer 

and traffic shaper in an IPE within the preferred server 
system. A received cell is assigned to a particular user data 
structure assigned to the incoming link for the considered 
user. As discussed earlier, the policing information can be 

20 directly obtained from the user who is sending a packet based 
on the connection identifier, the corresponding session ID, or 
the IP source address- However, if the packet on the incoming 
connection cannot be directly associated with a user or 
logical link group, then the user and/or logical link group 

25 for whom it is destined classifies the packet. Based on the 
obtained user and logical link information the incoming 
traffic stream is policed by queuing up the packets and 
enforcing the negotiated traffic contract. 

Once the packet conforms to the incoming link 

30 requirements, the packet is shaped based on the user 

parameters and logical parameters for the outgoing link. These 
parameters are obtained from the user connection itself if a 
session ID can be associated with it. If the packet comes from 
a user and is forwarded across the Internet to a remote 

35 terminal, then the shaping parameters are obtained from the 
sending user for the corresponding link and the associated 
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logical link group. For connectionless traffic, which cannot 
be directly associated with users, logical link group can be 
assigned based on the IP destination address and or source 
address. This allows managing traffic flows between networks. 
5 Switch Fabric Bandwidth Management and Scheduling : 

In order to meet the QoS requirements of individual 
traffic flows and to ensure that delay requirements of certain 
flows can be met, sophisticated scheduling must be conducted 
across the entire switch fabric. This scheduling takes into 

10 account the allocated user bandwidth, logical link share, 
buffer occupancy for output queues, available sub--port 
bandwidth, priority of class of traffic, and expected delay. 
All this is accomplished while maintaining high throughput 
across the switch fabric. 

15 Attached hereto as Exhibit A are details of the 

manner in which the preferred server system is programmed 
so as to minimize inter-IPE card communications. 

There are various changes and modifications which 
may be made to the invention, as apparent to those 

20 skilled in the art. However, such changes and 

modifications are suggested by the present disclosure, 
and the invention should therefore be limited only by the 
scope of the claims appended hereto, and their legal 
equivalents. 



25 
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EXHmrTA 



1.1,1 Une Cards ^Ingress Side 



Line Cards do cot perfOTtn any traffic policing. Policing is performed, m distributed fashion, by the 
all IPE cards in the systeoL If during testing, it can be determined that the Line Cards have enough 
processor and I/O bandwiddi to peifonn policing, this fimction mi^ be moved to the Line Cards in a 
iutiirevczsion of the software. Also, Line Caids do not pedbnn any nmting table tooiki^js. 

All IP packets xeccived, rpgardless of their encapsulattc»i, must have their destinatron IP address 
. captured and examined by a LC PPU. One operation duU most be peifoimed is detenniiung if ^ 
destinaticm IP address is one of die IP addresses of our system. This can be done using a sinq>le hash 
tabk. A fiill QDR routing search is not necessary, since we are only looking for an exact matdt The 
result of the lodbq> ^ successful) is die Cardid of the IPE diat the address belongs to. If a matdi is 
fomid and the Cardid is equal to die Cardid of die IPE diat the packet is about to be forwarded to, die 
packet must be forwarded with die Destmatioa FPU bit set This is so diat when the padcet is 
received, die PI can select die padcet to be c^rtiued in its entirety (as kmg as it is not part of a non- 
eaciypted tnnnel). 

Additionally, if die padcet is an IPsec packet diat has been received fiom a large user^ and the 
destination IP address is one of the addresses of die IPE to whidi the packet is about to be forwarded 
to, die Usedd should be determined based on die IPsec Security Parameter Index (SPI) radier dian on 
die hash of the source and destination IP a^^resses in the P header of die padcet These opocations 
win be discussed in greater detail in die sectums diat folk) w. 

PIE Header 



«2 



FID 



T 



Dst 
PPU 



CP (wiHiPPU access infininarinn) 



Payload Length 




3S 




3S 
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lsl.2 ATM line Csnl 

Small Virtual Circuits 

The foUowing infonnation is sent to tiie IPE FPU along with die packet payload: 

• JnihcCSlX Header: 

• DestmadonFIowId: .in.. 

Sent in flie CSIX Header of eveiy ceU of tiie packet to identify where me switch fabric 
should send it as well as the priority (class) of Ihe packet 

• hiike Cell Header: 

« Source Flowld: . . t u 

Sent mfkeCdl Header to allow the IPE to reassemble the packet TTus is swaply the 
identification of the line card (in fee least significant 8 bits) and die priority (class) in the 
most significant two bits. The priority MUST be Ae same as is specified in the 
Destination Flowld of this packet 

• DiscaidBit: 

Set by die LC PM to inform the IPE diat an eiror (IP checksum, AAL5 CaiC, mtOTial 
parity) was detected in die packet 

• EOPBit 

Set by die LC PM to mdicate die last cell of die packet 

• Jnibi&PiB Header: 

• InitialPID: , ^ ^ t. * 

This 3 bit field tells die IPE die type of enc^jsuktionduspaclcet has- The choices are: 

IPC (for intcr-processor communications), IP. PPP, Ediemet, ATM, <Mr MPLS 

• InitialStage: ^ ^ ™ t. * 

Tliis 4 bit field can be used to give additional information to die IPE about tue 
encapsulation of diSs packet It specifies whidi stage in die IPE PI wiU be die first to 
inspect the packet 

• DestinationPPU(lbit): 

This bit is set for IP packets whose destination address is equal to one of die IP addresses 

of die IPE card diat packet is l>eing sent to. 
« Destination PPUID: 

The PPU idwitifier of die IPE PPU diat die packet is being sent to. 

• PECID: , . ^ . 

An mdex into die connection table of die BPE PPU diat die padcet is bemg sent to. 

The PI uses die W Wa and PHYTO to calcukte die IX: OD, wMA is used by die PI as m 

die baidwarc connection table. Tbe PI reads (amongst odier tilings) a LC PPUID which selects die 

LC PPU diat die control infonnation for die packet should be sent to. 

The LC CID is also used by die LC PPU as an index into a software connection table. Typically 
(dioueh not always), diis connection table is used to detennine die UserJd (which consists of a 
Dest^tion CardId, Destination PPUID. and IPE OD) diat is sent to die IPE m die PIE Header of &e 
packet As packets are inspected by die LC, a determination of priority (one of four classes) is m^ 
based on die protocols found in die packet. Alternatively, die priori^ could be read firan die 
connection table. Ibis priority is used to determine die two most agmficant bits of die Destination 
Flowld when the packet is forwarded to an IPE. 

The ATM ceU headeis and die AAL5 trailer and paddmg arc rraooved (by die PM) before fcowardmg 
die padcet 

Tbe following is a list of aH die different types of top level protocol encapsulation diat can be recdved 
by die ATM line card, along widi an explanation of die processmg diat must take place <m die Ime 
card for each type of protocol: 
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1.1 J2.1.i IP (RFC-1483) 

The BP packet is forwarded to the IPE. The IPE OD is detennined by reading the software 
coimectioii table. 

1.1.2.1.2 Ethernet (RFC-1483) 

Each ATM LC must have a standard globally unique Ethernet MAC address pamanently assigned to 
it Each Elhemet/ATM VC should be configurable as to whedier or not it is in "pronuscuous** mode - 
that is, whediCT or not it should discard unicast paclo^ not sent to its MAC address. 

All EOiemet packets are forwarded to the IPE witti Husk MAC headers intact, except for PPPoE 
session packets (ethertype = OxS864). For non-PPPoE session packets, the IPE CLD is determined by 
reading the software connecticm table, and the Initial PED is set to indicate an Ethernet packet 

For PPPoE session packets, the PPPoE header is removed, and die PPPoE Session ID (fiom tiie . 
PPPoB header) is used to index into a PPPoE session table, from whidi tiie IPE QD can be retrieved. 
In this case die Initial PID is set to indicate a PPP padcet Additionally, if the PPP/PPPoE protocol 
type is 1P« die PPP header is also removed before die packet is forwarded to die IPE:, and die Initial 
PID is s^to indicate an IP packet 

I.IJLI^ PPP 

If die PPP protocol type is IP, die PPP head^ is removed, and die Initial PIDS is set to indicate IP, 
odierwise» the PPP header is kiq>t, and die Initial PIDS is set to indicate PPP. The IPE OD is 
d^ennined by reading die sofiware connectioii table. 

1.1^1^ MPLS 

The top of stack shim label (in die AAL5 PDU) is rq)laced widi die VPI/Vd of die vhtual drcoit 
The VPWCI can be deduced from die LC QD. The IPE QD is determined by reading die software 
connection table. 

1. 1.2.2 Large Virtual Ciiddts 

Widi die exccpdaa of die following dianges, large virtual dtcuits are handled in die same way as 
small virtual circuits. 

The PI DFU control roisters can be programmed (by dse MPPU) widi the LC QD * s of iqp to 4 large 
virtual circuits. For diese circuits, if the padcet contains an IP header, die PI DFU TviUrepkcet^^ 
PPUID read from the hardware connection table with a LC PPUID lead from a hash table which is 
indexed by a hash of die source and destination JP addresses of the padcet (caknlated by the PI DFU) . 

Any of die entries (cizcoits) in die software VC ocmnection table can be made^ 
mult^le IPE FPUs. These are known as /iar^getcsm, and need iiot be die same virtu^ 
distributed by die DFU as explained above. For these drcoits, if the padoet contains an IP header, a 
new hash is calculated ovct the source and destination IP addresses of die padcet and used to select 
one of several Userlds destination Cardid, Destination PPUID, and IPE QD) diat are s^t to die IPE 
in the PIE Header of tbs padcet The most significant bits of die Destinatkin Flowld are still used to 
select die priority (class) of die padcet However, in the case of an IPsec packet addressed to the IPE 
card diat the packet will be forwarded to, die Userld is selected using a diCfeient means, as described 
in the IPsec protocol processing section bdow. 



Isl^ POSUneCard 

The following information is sent to die IPE FPU akmg widi the pad^ pai^oad: 
• hxfla&CSIX Header: 
• Destination Flowld: 
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Sent in the CSIX Header of every cell of the packet to identify where the switch febric 
should send it as well as the priority (class) of the pack^ 

• Intht Cell Header: 

• Source FlowI± 

Sent in the Ceil Header to allow the IPE to reassemble the packet This is siinply the 
identification of the line card (in tiie least significant 8 bits) and the priority (class) in die 
most significant two bits. The priority MUST be the same as is specified in the 
Destination Flowld of this packet 

• Discard Bit 

Set by the LC PM to mfoim the BPE that an error (IP checksum, internal parity) was 
detected in die packet 

• EOPBit 

S^ by die LC PM to indicate the last cell of die packet 

• ln1te PIE Header: 

• Initial PID: 

This 3 bit field tells die IPE the type of encapsulation dus padcet has. The choices are: 
IPC,IP,PPP,orMPLS 

• Initial Stage: 

This 4 bit field can be used to give additional information to die IPE about die 
encqisulation of diis packet It specifies which stage in die IPE PI will be die first to 
inspect die packet 

• Destination FPU (1 bit) : 

This bit is set for IP padcets whose destination address is equal to one of die IP addressee 

of die IPE card thatpadcet is beii% sent to. 

• Destination PPUID: 

The PPU identifier of die IPE PPU diat die padcet is being sent to. 

• IPECED: 

An index into die connection table of die IPE PPU diat die padcet is being sent to. 



Unless MPLS^PP/SONET is bemg used, each PPP/SONET PHY conq)rises a single user. Whcm 
MPLS is m use, however, each MPLS Labd Switched Padi OLSP) rqaesents an additional user. 

For POS Line Cards, die LC CID is shnpfy die PHYID. The PI DFU control registers can be 
programmed(bydieMPPU) widitiieLCOD'sofup to4PHYs, For diese PHYs, if die packet 
contains an IP header, die PI DFU will replace die LC PPUID read firom die hardware connection 
table widi a LC PPUID read fi:om a hash table which is indexed by a hash of die source and 
destination IP addresses of die packet (calculated by die PI DFU). This capabiH^ of die PI DFU mnst 
be used for OC-192c and CC-48c PHYs m order to distribute die load over multiple LC FPUs, For 
0C-12C and smaller PHYs, die PI DFU need not be used Instead, die PI uses die LC CID as mdex 
into die hardware connection table. Tbe PI reads (amongst odier tilings) a LC PPUID which selects 
die LC PPU tiiat die control information for die packet should be salt to. 

As packets are inflected by die LC, a detennmation of laiori^ (one of fw This 
priority is used to determine die two most significant bits of die Destination Howld when die pac^ 
is forwarded to an IPE. 

The following is a list of all die different types of top level protocol encqisulation diat can be received 
1^ the POS line card, along widi an eaqjlanation of die processing duit must take pla^ 

for eadi type of protocol: 

1 1,3.1 PPP Control Pfotocol (LCP, PAP, CHAP, IPCP, MPtSCP, etc^) 

This categoiy indudes not onfy PPP Control Protocols, but also any Netwodc Protocol odicr dian IP 
or MPLS. 
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The LC PPU uses the LC CID (which is really just the PHYID) as an index into a software PHY table. 
This table provides the Primary UserH which determines where the packet is sent as well as the BPE 
CID that is sent in the PIE Header of the packet For large and small PPP/SONET users, all non-IP 
and non-MPLS packets are sent to the IPE PPU identified by the Primary Userld, No distribution is 
perfoxmed for these packets. Also, Ae Initial PID is set to indicate a PPP packet 

1.1^.2 IP 

Tlie LC PPU uses tiie LC CED (whidi is really just the PHYID) to index into and read fi:<Hn the 
software PHY table. From this die LC detennines whether this is a small user or a large user. For 
5ifia//ttfers,tiie/VTO!aiy C/jer/d is also read ftc^ This detennines where the packet is 

sent as well as the IP£ COD diat is seittmflie P/E if eo^fer of Aepadc^ 

For large userSy however, a hash is calculated over the source and destination IP addresses of flie 
pad^et and used to select either the Primary UserlD or one of seveial Secondary UserlDs. The 
sdected UserlD detennines where the packet is sent as well as &e IPE CID that is sent in &e PIE 
Header of fbcpAdkisL 

The PPP header is removed before die pack^ is f<»warded to &e IPE, and die Initial PID is set to 
indicatse an IP padcet 

1.1^3 MPLS 

As is die case widilP/PPP/SOl^, die LC cm only identtOes the PHYID, Therefore, when die LC 
PI idoitifies an MPLS pad^ die top of stack label mnst be captured in order to identify die user. 
Foreai^POSPHY, dielX:PPUnuistinamtainatableofMPLSLSPs. The LC OD selects 
tableland die top of stadc label is used to index into die table. Tor small users^ibs Primary UserlD 
diatcorre^nds to the LSP can dien be read die table, Fbr faigcitrcrr, however, a smrilar process to 
die one described above for IP is used A hash is calcukted over die source and destination IP 
addresses of the packet and used to select eidier die Primary UserlD or one of several Secondary 
UserlDs, The selected a5cr/i)detemiines where die packet is sent as weU as die IPE cmd^^ 
in die FZE /feoi/er of die padcBt 

Ibe PPP header is removed before die padoet is forwarded to die IPE, and die Initial PID is set to 
indicate an MPLS padcet 

EibemetLbieCara 

The following information is sent to die IPE PPU along widi die padcet payload: 

• JniticCSDC Header: 

• Destination Flo wid: 

Sent in the CSDC Header of every cell of die packet to identify v/hete die switch fabric 
dmuld s^d it as well as die priority (class) of die padcet 

• hitiic CeU Header: 

• Source Flowld: 

Sent m the Ceil Header to allow die IPE to reassemble die packet This is singly the 
identification of the line card (in die least significant 8 bits) and die piionty (class) in the 
most significant two bits. The piiorify MUST be die same as is q[>ecified in die 
Destmation Flowld of diis padoet 
" • Discard Bit 

Set by die LC PM to infonn die IPE that am emt (IP chedcsum, internal parity) was 
detected in the packet 

• EOPBit: 

Set by the LC PM to indicate die last cell of die packet 

• Isiike PIE Header: 

• Initial PID: 
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This 3 bit field tells the IPE the type of encapsulation this packet has. The choices are: 
IPC, Edicmct, PPP, IP, or MPLS. 

• Initial Stage: 

This 4 bit field can be used to give additional information to the IP£ about tbt 
encapsulation of this packet It specifies which stage in ibG IPE PI will be the first to 
inspect the packet 

• Destination PPU (1 bit): 

This bit b set for IP packets whose destination address is equal to one of the IP addresses 
of the IPE card diat packet is bdng sent to. 

• Destination PPUID: 

The PPU identifier of die PE PPU that the packet is being sent to. 

• IPE ODD: 

An index into the connection table of the IPE PPU that (he pacioet is being sent to. 

Unless MPLS or PPPoE is being used over the Ethernet, eadi PHY comprises a single user. When 
MPLS or PPPoE is in use, however, each MPLS Label Switched Path (LSP) or PPPoE session 
v^resents an additional user. 

For Etii^et Lhie Cards, the LC OD is simpty tiie PHYID. The PI DFU control legisteis can be 
programmed (by the MPPU) with the LC CID*s of up to 4 PHVs. For these PHYs, if the padcet 
contams an IP header, flie PI DFU will iq)lace the LC PPUID read firom Ac hardware c<»inection 
table widi a LC PPUID read ficom a hash table which is indexed by a hash of the source and 
destination IP addresses of tiiepadcet (calculated by the PI DFU). This capability of die PI DFU must 
be nsed for 10 G^a^it Ethernet Cards in Older to distribute the load ova- multiple Fori 
Gigabit and smaller PHYs, the PI DFU need not be used. Instead, ^ PI uses LC OD as mdex 
into die hardware connection table. The PI reads (amongst other things) a LC PPUID wfaidi selects 
the LC PPU that the control information for the padcet should be sent to. 

Each Etiiemet PHY most have a globally unique Etiiemet MAC address permanent^ assigned to it 
All E&CTet packets are forwarded to tte IFE witii tiieir MAC headers intact, and the witii Ae Initial 
FIDS set to indicate Ethernet, except for MPLS and PPPoE session packets (edieitype 0x8864). 

As padcets are inspected by the LC, a determination of priority (one of four classes) is made. This 
priority Is used to detramme two most significant bits of tiie Destination Flowld when the padcet 
is forwarded to an IPE. 

The following is a list of all tfie different types of top levd protocol encapsulation Hxst can be received 
by tiie Ethernet line card, along with an explanation of the processing tiiat must tal^ place on tiie line 
card for each type of protocol: 



1.1^1 IP 

The LC PPU uses the LC CID (which is really just tfie PHYID) to index into and read ftom tiie 
software PHY table. From diis die LC determines whether this is a small user or a large user. For 
smaU users^^e Primary Userld is also le^dfnmfhc FEY t^^^ This detenniiies ^vviieie tiie padcet is 
sent as well as the IPE QD that is sent m tiie PIE Header of tiie padcet 

For large users, however, a hash is calculated over the source and destination IP addresses of die 
packet and used to select eidier die Primary UserlD or one of several Secondary UserlDs, The 
selected UserlD determines where the packet is sent as well as fte IP£ CID diat is sent in tfie PIE 
f/eoifcr of die packet 

The packet is forwarded to die IPE widi die Ethernet MAC header mtact, and the Initial PID is set to 
indicate an Ethernet padcet 
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PPPoE Session . m 

For PPPoE Session packets, the Ethernet and PPPoE headers are removed, and the PPPoE Session ID 
(from the PPPoE header) is used to index into a PPPoE session table, from which the Useild (IPE 
Cardid, IPE PPUID and IPE CID) can be retrieved. A unique PPPoE Session table can be maintained 
for each PHY, and Ac LC CID can be used to select whidi sessicm table to use. 

If PPP protocol type is IP, the PPP header is also removed, and Ac Initial PipSIs set to indicate 
IP, oflicrwise, the PPP header is k^t, and Ae Initial PIDS is set to indicate PPP. 

4 •1.4.3 MPLS 

The LCCro only identifies the PHYID that die padcet was received on. Therefore, when die LC PI 
identifies an MPI^ packet, &e top ofstack label must be captured in order to iden^ For 
each Ethernet PHY, the LC PPU must maintain a table of MPLS LSPs. The LC CID selects which 
table, and die top of stack label is used to mdex into die table. For mull MPLS users, die Prmary 
UserJD diat corresponds to the LSP can dien be read the table. For large users^ however, a smiilar 
process to the imedesCTibed above for IP is used. A hash is calculated ov«: die source and destn^tion 
IP addresses of the packet and used to select ddicr the Prmary Userm or one of several Secondary 
User IDs. The selected UserlD detenmnes where die padcet is sent as weU as die IPE CID fliat is sent 
in the PIE Header of the padcet 

The Efliemet header is removed before die packet is forwarded to die IPE, and die Initial FID is set to 
indicate an MPLS packet 

i.1.4.4 Other Ethernet Protocols (AW, PPPoE Discoveiy,et^ ^™^„e • 

This category inchides all EfliOTirt protocols (ediertypes) odicr dianlP. MPLS, and PPPoE Session. 

The LC PPU uses the LC CEO (which is realty just die PHYID) as an index into a software PHY table. 
This table provides die iVimoTy OserW which determm 

OD diat is sent in die P/£^ea</er of die packet For it^e and otio/Z E&emet iiscrr, dicse padcets 
are sent to die IPE PPU identified by die Primary Userld. No distribution is perfonned for these 
packets. Also, tiie Initial PID is set to mdicate an Etiiemet packet 

1.2 line Cards -Egress Side 

Line cards perform all the tcaf&c shaping for the system. 

1.211 ATM Une Card 

The following infonnation is received from IPE PPU along widi the packet payload: 

• In die CSDC Header : 

• Destination Flowld: . 

Sent in the CSIX Header of evoy cell of die packet to id«itify where die switdi febnc 
should send it as well as die priority (class) of the packet 

• In die Cell Header: 

• Source Flowld: . . u 

Sent in die Cell Header to allow die LC to reassemble die packet This is smelly the 
identification of die BPE card (in die least significant 8 bits) and die prioiity (class) in the 
most cignilicant two bits. The priority MUST be same as is specified in the 
Destination Flowld of diis padiet 

• DiscardBit • . v j . . , 

Set by die IPE PM to inform die LC that an eiror (mtemal parity) was d^ected m the 



• EOPBit: 

Set by die IPE PM to mdicate die last cdl of die packet 
In the PIE Header: 

• InitialPID: 



wo 01/67694 



PCT/USOl/01003 



43 

This 3 bit field tells the LC the type of encapsulation this packet has. The choices arc: 
IPC (for inter-processor communications), IP, Ethernet, PPP, or MPLS 

• Initial Stage: 

Always 0. 

• Destination PPU(1 bit): 

Always 0. 
« Destination PPUID: 

Tlie PPU identifier of &e LC PPU that &e packet is being sent to. 

• LCCID: 

This is an index into the connection table of tiie LC. 

TTie Destination PPUID selects the LC PPU that win process the packet: The LC OD is used by the 
LC PPU as an index into a software connection fable. This connection table provides die shaping 
parameters, any additional encapsulation that must be added by ^ LQ the PHYID. and die ATM 
VP Wa for tiie packet 

The priority (one of four classes) is based on die two most significant bits of die Source Howld in the 
CdSL Header. Tbe priocily is used by tbc Traffic diaper and 4ie Sdiedider to detemiine when to 
forward the packet to tiie PHY. 

The ATM cell headers arKl Ae AAL5 trailer and paddmg are alw^ added (by Ae PM) befiwe 
forwarding die padcet to die PHY card. 

The following is a list of all the di£faent types of top level protocol encapsulaticm diat can be received 
from an IPE by flie ATM hue card, along wifli an e:q>]anation of die processmg Aat must take place 
on the line card for eadi type of pxotocob 

1J2a1 I**! IP 

The desired enc^)Sulation for die padcet can be eiflier IP/PPP/PPPoE^cmet/ATM. IP/PPP/ATM or 
IP/ATM The PPU can dctennine which it is from die connection table. If die encapsulation should 
be IP/PPP/PPPoE/Ediemet/ATM, die connecticm table will provide the necessary information to add 
the missing headers. If die oicapsulation should be IP/PPP/ATM, a PPP header identifying die 
protocol as IP is added. Also^ die entry in the connection table may specify fbzt an LLC header 
should also be added. 

^JZAAJZ Ethernet , . . . . . 

The connection table may specify diat an LLC header should be added to the beginnmg of the pa<aceL 
Odierwise die padoet is sent as is. 

1J2.1.1.3 PPP 

The desired cnc^)sulation may be citiier PPP/PPPoE/Ediem^ATM or PPP/ATM The PPU can 
determine i^ch it is from the connectim table. If it is PPP/ATM, tiie packet is sent as is, odierwise, 
die connection table wifl provide the necessary information to adi a PPPoE header and an Ediemet 
Header. 

MPLS 

As widi all odier encapsulations, die proper VPI/VO is obtained from the connection table. 



1.2.2 POS Line Card 

The following information is received from IP£ PPU along widi die packet payload: 
• In die CSJX Header: 
• Destination Flowld: 

Sent in the CSIX Header of evay cell of die padcet to identify where die switch fabric 
should send it as well as die priority (class) of the padcet 
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• In the Cell Header: 

• • Source Flowld: 

Sent in die Cell Header to aDow die LC to reassemble the packet This is simply the 
identification of the IPE card (in die least significant 8 bits) and the priority (class) in the 
most significant two bits. The priority MUST be the same as is spedfied in the 
Destination Flowld of this packet 

• Discard Bit 

Set by the IPE PM to inform the LC that an eiror (internal parity) was detected in the 
packet 

• EOPBit 

Set by die IPE PM to indicate the last cell of die packet 

• In ihe PIE Header: 

• Initial PID: 

This 3 bit field tells die LC die type of encapsulation diis packet has. The choices are: 
IPC (for inter-processor aHmnmucations), IP, PPP, or MPLS 

• Initial Stage: 

Always 0. 

• Destination PPU (1 bit): 

Always 0. 

• Destination PPUID: 

Tl^ PPU idoodfier of die LC PPU diat the packet is bemg sent to. 

• LCQD: 

This is an index into die c(xmectioa table of die LC. 

The Destination PPUK) selects die LC PPU diat will process die packet The LC OD is used by die 
LC PPU as an index into a software connection table. This connection table provides die shqiing 
patamet^ anddiePHYID for die packet 

The following is a list of all die different types of top level protocol encapsolation that can be received 
by tfaePOS hue card, along widi an explanation of die inocessmg diat must take place on die Hue card 
for eadi tyjpt of protocol: 

1Jt2.1 PPP 

No additional processing is needed. 

^JZJZJZ IP 

A PPP header identifying die packet as an IP packet is added. 

iJ2^3 MPLS 

A PPP header identifying die packet as a MPLS packet is added. 



1.Z3 Ethernet Une Card 

The foQowmg mfonnaticm is received fiom IPE PPU along widi die packet payload: 

• In the CSIX Header: 

• Destination Flowld: 

Sent in the CSIX Header of every cell of die packet to identify where die switch fabric 
should said it as well as die priority (class) of die packet 

• InihsiCeB Header: 

• Source Flowld: 

Sent in the CeU Header to allow die LC to reassemble die pad^ This is sm^ly the 
identification of die IPE card (in die least significant 8 bits) and die priority (class) in die 
most significant two bits. The priority MUST be die same as is qiecified in the 
Destination Flowld of diis packet 

• DiscaidBit: 
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Set by the IPE PM to infonn the LC that an error (internal parity) was detected in the 
packet. 

• EOP Bit 

Set by the IPE PM to indicate the last cell of &e packet 
• hiUxt PIE Header: 

• Initial PID: 

This 3 bit field teUs the LC the type of encapsulation diis padki^ has. The choices aie: 
IPC (for inter-processor conununications), Edieinet, PPP^ IP, or MPLS 

• Initial Stage: 

Always 0. 

• Destination PPU(1 bit): 

Always 0. 

• Destination PPUID: 

The PPU identifier of fiie LC PPU that packet is being sent to. 

• LCCID: 

This is an index into tibe connection table of &e LC. 

The Destination PPUID selects the LC PPU diat will process ^epadcet The LC QD is used by die 
LC PPU as an index into a software connection table. This ccHinection table provides ^ shiq[>ing 
parameters, and &e PHYID for the packet 

The following is a list of all die diff^ent types of top level protocol oicapsalation diat can be received 
by die Edionet line card, along wifli an explanation of iSoR prooessii^ that must take place on die line 
. card for each type of protocol: 

Ethernet 

No additional processing is needed. All IP/Ethernet are sent using this type because die IPE, not the 
LC inq>Iements ARP, and dierefore adds die Bdiemet heado* to all IP padcds befm sending fton to 
dieLC. 

1JL3^ PPP 

Tbe desired encapsulation is PPP/PPPoE/Ediemet. The connection table provides the necessary 
infinmation to add a PPP6E header and an Ediemet header. 

1:2.3.3 IP 

The desired encapsulation is IP/PPP/PPPoE/Ediemet A PPP header mdicating an P packet is added. 
The connection table then provides the necessary information to add a PPPoE header and an Ethernet 
header. 

1.2.3.4 MPLS 

The connection table provides the information needed to add an Bdiem^ header (die destination 
MAC address is all that is required from die connection table). 



1.3 IPE Card 

IPE Ingress Protocol 

All packets received by an IPE card from the Line Cards (or from odier IPEs) will be of one of the 
following types. The Initial PID field in the PIE Header will identify which one of diese types each 
packet corresponds to. If diere are more than 8 such types, the Initial Stage field in die PIE Header 
can be used to select a different stage to b^m uiq)ect]on, each of which allows 8 additional protocols 
to be identified by die InmcA PID field. 

The IPE CID and PPUID in die PIE Header of die received paicket combine widi die Fiowld to give 
dieUserld. Onlydieleastsignificant 18ofdie20bitsof dielPECIDarensed. 
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1^.1.1 IPC 

These packets are used for inter-processor communication within the system. The PI should be 
programmed to capture these packets to a PPU (as specified in the Destination PPU field in Ae PIE 
Header) in their entirety. 

1^.1 IP 

These packets consist of only an IP packet That is, an IP header, followed by an IP payload (which 
might include TCP, UDP, ICMP, etc.). This category does not include IP packets received over 
MPLS or over E^exnet 

These packets can come firom either a POS LC, an ATM LC, or an Ethernet LC The possible 
enc^isulations that could result in such a packet are: IP/ATM, IP/PPP/ATM, IP/PPP/SONET, 
IP/PPP/PPPoE/Ethemet, andIP/PPP/PPPoE/E&emet/ATN4. 

The IPE CID uniquely identifies the PPPoE Session ID, or the ATM Virtual Orcuit that die packet 
was received on as well as the PHYTLC diat it was received on. In the case of IP/PPP/SONET, die 
IPE CID win identify only die PHY/LC that die packet was recehred <m, 
all IP/PPP/SONET packets leceived fiwrn a particular PHY/LC 



1.3.1.3 PPP 

This catBgoiy consists of all PPP packets received whose PPP protocol type was not IP or MPLS. 
These pack^ can come fimm a POS an ATM LC; or an Ethernet LC For diose PPP sessions diat 
will be tunneled using LOT. die IPE must add a new PPP header to the IP/PPP and MPLSflPPP 
packets, since for diose protocols, Ac PPP header will have be«i removed by die Line Card. 

The possible encapsulations diat could result in such a packet are: PPP/SONET, PPP/PPPoE/Ediemet, 
PPP/ATM. and PPP/PPPoE/EfliOTiel/ATM. 

The IPE CID uniquely identifies die PPPoE Session ID, or die ATM WhUisl Circuit diat die padcet 
was received on as wen as die PHY/LC fliat it was received on. In die case of PPP/SONET, flie IPE 
CID win identify onfy die PHY/LC diat die packet was received «!, that is, ^ 
PPP/SONET padcets received ftom a particular PHY/LC. 

1^.1^ Ethernet 

This category consists of aU Native Ediemet or Ediemct/ATM packets received except fiir MPLS 
(edi^type = Ox????) and PPPoE data packets (ediertype = 0x8864). This category also mchides 
packets whose destination MAC address is not equal to die MAC address of die PHY on which die 
packet was received (broadcast and multicast padcets, and unicast padcets if in pnmdscuous mode). 

The possible encapsulations diat could result in such a paclrct are: ARP/Ediemet, IP/Ediomet, PPPoE 
Discoveiy/Ediemet, ARP/Ediemet/ATM, IP/Ediemet/ATM. and PPPoE Discovery/Ediemet/ATM. 

For Ediemet/ATM, die IPE CID uniquely identifies die ATM Virtual Circuit diat die packet was 
received on as well as die PHY/LC that it was received OIL In the case of Native Ediemet, the IPE 
CID will identify onfy die PHY/LC that die padnt was received on, that is, it will be constant for all 
padcets received from a particular PHY/LC. 

1^.1^ MPLS 

This category consists of packets whidi b^in widi an MPLS label stack. These. can come nom a 
POS LC, an ATM LC or an Ediemet LC 

The possible encq)Sulations diat could resuh in sudi a packet are: MPLS/PPP/SONET, 
MPLS/Ediemet, and MPLS/AIM. Indiecaseof MPLS/AIM, die line Card wOlhave leplaoed die 
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top of stack shim label with the real label because the real label was encoded as die ATM VFI/VQ in 
die packet received from die network. 

The following encapsulations are NOT supported: MPLS/PPP/PPPoE/Ediemet. MPLS/PPP/ATM, 
MPLS/PPP/PPPoE/Etiiemet/ATM, and MPLS/Ediemct/ATM. 

The IPE CID uniquely identifies die same as die top of stack incommg top of stack MPLS label, as 
well as die PHY/LC diat it was received on. In the case of MPLS/ATM die top of stack label has a 
one to one correspondence widi die ATM Virtual Circuit diat die packet was received on. 

1.5.2 IPE mgtess Protocol Decoding 

The following table shows the first two layers of protocols diat must be identified by die PI on the IPE 
for each packet diat passes dirough it 



IP 




Ethernet 


IP 


ARP 


PPPoE Discovery 


MPLS 


IP 


PPP 


Control Protocols 



13^1 IP Packets 

AD IP packets received by die IPE, v^^edier sdll encapsulated widi Ediemet, widi MPLS, or widiout 
encapsulation, will fisill into one of two categories: diose for which die destination IP address is equal 
to one of die addresses of die IPE, and diose for which it isn't In die case of die latt^, die pad^et 
must be forwarded or discarded by the PPU. Butfordiefoinier,itinustbedetemiinedwhcdicr oriiot 
die packet can be processed entirely by die PPU, or whedier it must be sent to die MPPU for fiirdier 
processii^ If it must be sent to die MPPU, it must be captured in its entirety. 

All IP packets received, i^ardless of dieir encapsulation, must have dieir destination IP address 
cq>tared and exammed All routii^ table seardies are performed by die IPE cards. If the destination 
address is one of die system's IP addresses, but not one of die IPE card's addresses, die packet must 
be forwarded widi Destination PPU bit set 



1.3.2^ L2TP Tunnels . . ^ 

Each L2TP tunnel is handled entirely by a particular IPE card. Each session widun the tunnel must be 
handled aitirely by a particular PPU. This requirement conies primarily from die need to siqiport 
sequence numbers on the data sessions: 

RF02661: ''Each peer maintains separate sequence numbers for ihe control connection 
and each individual data session within a tunnel " 

Therefore, lar^ PPP users cannot be tunneled. All L2TP control packets are forwarded to and 
piocessed by die MPPU of die IPE card. 

1 .3«2^1 t2TP Access Concenteator (LAC) 

Any PPP user can be sdected f<tf L2TP tunneling by die IPE MPPU. If a user is selected for 
tunneling, dien die PPU receiving PPP packets firom diat user must encapsulate diose padcets, first 
widi an L2TP header, tiien a UDP header, and finally an IP header. The IP header's destination 
address will be diat of the configured LNS, and die source address will be one of die IP addresses of 
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IPE. The resulting BP packet can then be forwarded using the standard IP forwarding procedure to the 
appropriate Line Card for transmission. It should be evident that tunneled PPP users on different IPE 
cards will be placed in sq>arate tunnels even if being tunneled to the same destination LNS. 

IP packets received ftom the LNS will be sent by the receiving Line Card to ihc IPE PPU associated 
with the ingress interfece {user). This PPU may well be on a different IPE card than the one handling 
thetumiel. This is easily drtermined ftom the destination IP address of the packet In diis case, the 
PPU receiving tiie packet from the Line Card must forward the packet to the IPE card handling the 
tunnel In addition, the L2TP Session H) can be used to identify which PPU on that IPE caid should 
receive the packet (this PPUID must be sent in tfie PIE Header so that tiie receiving PI will know 
which PPU should receive tiic packet). This is done by always caicodmg the PPUID of the PPU 
li^ pHling a particular session in the most significant four bits of the L2TP Session ID. 

RFC-2661: "Since L2TP sessions are named by iden^ers that have local significance 
only. That is, the same session will be given different Session Ids by each end of the 
session. Session ID in each message is that of the intended recipient, not the sender. " 

The PPU to whidi the padcet is sent to can in turn can de-encapsulate the PPP packet and forward it 
to OePPP u5£ridaitifi£dbytiieL2TP session ID. 

1.3.2JL2 L2TP Network Server (LNS) 

When functioning as an LNS, L2TP packets received from the LAC will be forwarded, either by a 
line Card or anodier IPE, to &e IPE handling die tunnel This is because die destination IP address of 
tiiepacketwfflbeeqqaltooneofflielPaddressesoftheffEhandln^ftetunneL WiteitiiatIPE,tiie 
PPU tiiat should process tiie L2TP session is identified using tiie most significant four bits of the 
L2TP Session ID. The PPU will de-encapsulate the PPP packet, then process tiie PPP packet as if it 
was leceived frcm a PPP user. From Ais point on, die processing is die same as for a "leaT PPP 
user. 

In the other direction, packets which, when flicir destination IP address is looked up in the routing 
table, yield a destmation PPP user tiiat is associated wifli a LOT tunnel instead of with a Line Card, 
must be sent to tiie IPE PPU handling tiie PPP ojer. This is because of the sequence number 
requirement of L2TP mentioned above. Once received tiiis PPU, the packet must have a PPP header 
added, as is the case with a normal PPP user. Tlien, instead of forwanhng the packet to a Line Card, a 
L2TP header is added, followed by a UDP header and an IP header. The IP destination address is tiiat 
oftiieLACat&eoflierendoftiietnnneL Tbe lesuMng IP packet can then be forwarded using the 
standard IP forwarding procedure to the appropriate Une Card for tzansmission. 



1^J2.3 IPSec Tunnels ^ ^ . . 

Eadi IPSec Secori^ Association (SA) is handled ^itirely by a particular IPE PPU. As defined m 
RFC-2401, a Security Association is a unidirectional, "shnplcx" connection that provides security 
services to die traffic carried by it 

1 ^.2^1 Inbound IPsec processing 
• Plain padcets 

Every PPU must have a copy of the SPD for every user from which it receives packets. In other 
words, for eveiy UserlD {Primary oi Secondary) tiiat pomts to a particular IPE PPU, the PPU must 
have a pointer to an SPD. If a user's traffic is split among multq)le PPUs (i.e.: a large user), Acn tiiey 
should have identical SPDs configured for die icser, and each will create its own set of Security 
Associations for its share of the urer^s traffic. Evory packet leceived must be i»ocessed usiiig die 
SPD of die user die jpacket is received fiom. 
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• Tunneled packets 

The SPI is the field in the IPSec header that, along witfi the destination IP address, identifies the SA. 
Traffic ftom a small user will always be directed by receiving Line Card to a particular FPU. This 
PPU uses the SPI to identify the SA, and thus has access to the infonnation it needs to decapsulate Ae 
packet For large users, however, the Line Caid must detect IPsec packets whose IP destination 
address is one of die addresses that belongs to the IPE card identified by the tsser's Primary Userld. 
Radier tiban select a Userld {primary or secondary) based on the hash of Ae source and destination IP 
addresses of the packet, die LC nnist use the SPI in the IPsec header to select &e Userld, and thus the 
IPE PPU, to send tiie packet to. In order to accomplish this, die most significant 4 bits of an SPI 
always contain die PPUID identifying die PPU that is handling die S A identified by that SPL 

1^2^.2 Outbound IPsec processiiis 

Since BPScc peifonns tunneling at Layer 3, aitire users don't get tunneled. Rather, each packet about 
to be sent to a user is individually examined using die Security Policy Database (SPD) associated with 
diat user, fi^om which a pointer to a S A (in the SAD also associated wilfa die user) is obtained 

Tlie difiQculty widi oudx)und processing is that, as discussed eadier, die configuration information 
(and tims die SPD) associated with the egress user is not readily available. The informati<m must be 
requested fiom the PPU identified by the /Vwioo^ {/^er/J and stored in a cache. Each PPU sending 
to a user will ftms create its own set of Seoni^ Associations. 



1.3,3 n^E Packet l^rwanling and EgtessPpoces^ng 

The IPE card PPUs performs routing table searches for all padcets Aat'need forwarding. The global 
Forwarding Infonnation Base (FIB) is distributed to every PPU in die system, and contains IP unicast 
and multicast routing tables in a form that facilitates longest matchrag prefix searches (Le.: Patricia 
tries), as well as tables required for MPLS label based forwarding. 

One of the results of every routing table lookup is die Primary Userld identifying die layer 2 iut^ce 
by which die packet should be transmitted. It is mq)ortant to note diat die /Vinui?^ C/^er^^ 
same as die L CUserld, and does not direcdy give die Cardid of die Lme Card where die packet should 
beforwaided Radier, die Primary Userld identifies the IPE PPU that maintains die configuration and 
state information for die user. 

This presents a coirqilication because die IPE diat is trying to forward die padcet needs die 
infonnation stored on die IPE PPU identified by die /VwiflO't^J^^ Radier dian simply forward flic 
packet to die odier IPE for egress processing, which would result in additional latency and switch 
fabric bandwidth utilization for every packet, it sends a message to tj^e PPU identified by die Primary 
Userld^ requesting a copy of die user *s configuration. This information is kept in a user configuration 
cadie and is used for all subsequent packets directed to die same user, AH counters and statistics fliat 
need to be maintained for each user must also be maintained for eadi cached user, and must also be 
periodically sent to the PPU identified by die Primary Userld 

This process makes it difficult to implement such functi<MiaUty as per-uSCT traffic shaping in die IPE 
PPU, because die processing would need to be distributed among a potentially large nmnber of 
processors. Therefore, traffic shaping is to be inqiilemraited stricfly on die Une Card using die ^;ress 
PPUs. 

One of the fields diat is acquired and cached as part of the user configuration infonnation is the 
LCUserld. This field contains the Cardid of die Line Card diat die packet must be forwarded to, as 
wcU as die PPUID and QD dmt shouM be sent in die PIE header of die padcet to that Uto 
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What is claimed is: 

1. A packet processing circuit comprising: 

a packet inspector for examining a stream of cells 
to determine control information for packets represented 
thereby; 

5 at least one buffer access controller connected to 

said packet inspector for storing at least a portion of 
data cells received from said packet inspector, and for 
processing control information received from said packet 
inspector to produce additional control information; and 
10 a packet manager connected to said buffer access 

controller for receiving control information therefrom 
for use in formatting packets corresponding to said 
control information. 

2. The circuit of claim 1 wherein said packet 
manager is configured for using the control information 
received from said buffer access controller to reassemble 
said corresponding packets. 

3. The circuit of claim 2 wherein said packet 
manager is connected to said packet inspector for 
coordinating the dequeuing of data cells representing 
said corresponding packets from said buffer access 

5 controller. 

4. The circuit of claim 1 further comprising a cell 
buffer associated with said buffer access controller for 
storing said data cells. 

5. The circuit of claim 4 further comprising at 
least one protocol processing unit associated' with said 
buffer access controller for processing said control 
information received from said packet inspector. 

6. The circuit of claim 5 wherein said protocol 
processing unit comprises at least one general purpose 
processor unit. 
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?• The circuit of claim 5 further comprising an 
additional buffer access controller connected to said 
packet inspector, wherein said buffer access controllers . 
are configured for storing different portions of data 
5 cells received from said packet inspector. 

8. The circuit of claim 7 further comprising a 
protocol processing unit associated with said additional 
buffer access controller, and wherein said buffer access 
controllers are each configured for determining whether 

5 to forward certain control information received from said 
packet inspector to its associated protocol processing 
unit for processing. 

9. The circuit of claim 8 further comprising a 
master processing unit connected to said protocol 
processing units for providing said protocol processing 
units with configuration data. 

10. The circuit of claim 9 further comprising a 
switch, wherein said master processing unit and said 
protocol processing units are interconnected to one 
another through said switch. 

11. The circuit of claim 7 wherein each buffer 
access controller has at least two protocol processing 
units associated therewith. 

12. A mid-network server comprising: 

an input for receiving a packet delivered thereto; - 
a line module connected to said input for receiving 
said packet; 

5 a plurality of processing modules for performing 

mid-network processing functions; and 

a switch fabric connected to said line module and 
said processing modules for delivering packets 
therebetween, wherein said processing modules are at 
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10 least substantially identical to one another and 
independently programmable - 

13. The server of claim 12 further comprising an 
additional line module, wherein said line modules are at 
least substantially identical to one another and 
independently programmable. 

14. The server of claim 12 wherein said processing 
modules are each configured to support a plurality of 
packet types, and each line module is configured for 
formatting a packet into one of said types prior to 

5 sending said packet through said switch fabric to one of 
said processing modules. 

15. The server of claim 12 wherein said line module 
and said processing modules each comprise a packet 
inspector, a packet manager, and at least one buffer 
access controller. 

16. The server of claim 15 wherein said line module 
and said processing modules each comprise a plurality of 
buffer access controllers interconnected with said packet 
inspector and said packet manager. 

17. The server of claim 16 wherein each of said 
buffer access controllers have at least one protocol 
processing unit associated therewith. 

18. The server of claim 17 wherein each protocol 
processing unit is in communication with at least one 
other protocol processing unit on the same module. 

19. The server of claim 18 wherein said line module 
and said processing modules each comprise a master 
protocol processing unit for controlling the protocol 
processing units on that module. 

20. The server of claim 19 wherein said line module 
and said processing modules each comprise an Ethernet 
switch for interconnecting the master protocol processing 
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unit with said other protocol processing units on that 
5 module - 

21. A packet server comprising: 

an input for receiving a packet delivered thereto ; 
a line module connected to said input for receiving 
said packet; 

5 a plurality of processing modules for performing 

packet routing functions; and 

a switch fabric connected to. said line module and 
said processing modules for delivering packets 
therebetween, wherein said line module is configurable to 

10 send said packet to any one of said processing modules 
through said switch fabric, and said processing modules 
are each configurable to perfoinn said routing functions 
for said packet if said packet is sent thereto by said 
line module. 

22- The server of claim 21 wherein said line module 
supports a plurality of user interfaces and is configured 
to send said packet to one of said processing modules 
according to the user interface through which said packet 
5 arrives at said server . 

23. The server of claim 22 wherein each processing 
module includes a plurality of processing units ^ and said 
line module is configured to send said packet to one of 
said processing units of one of said processing modules 

5 according to the user interface through which said packet 
arrives at said server. 

24. The server of claim 23 wherein said processing 
modules are each configured to support a plurality of 
packet types, and said line module is configured for 
formatting said packet into one of said types prior to 

5 sending said packet through said switch fabric to one of 
said processing modules. 
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25. The server of claim 21 wherein said line module 
is connected to said input through a phy module. 

26. The server of claim 21 wherein said line modul-e 
and said processing modules each include a plurality of 
general purpose processing units. 

27. The server of claim 21 wherein said line module 
and said processing modules can be programmed to support 
any type of transmission protocol. 

28. A packet server comprising: 

a plurality of line modules for receiving packets 
delivered to said server over a physical connection; 

at least one processing module for performing packet 
5 routing functions; and 

a switch fabric connected to said line modules and 
said processing module for delivering packets 
therebetween, wherein each line module is configured to 
format a packet into one of a plurality of types prior to 
10 sending said packet through said switch fabric to said 
processing module, and said processing module is 
configured to support each of said packet types. 

29. The server of claim 28 wherein said processing 
module includes a plurality of processing units. 

30. The server of claim 29 wherein each line card 
supports a plurality of users and is configured to assign 
each user to one of the processing units of said 
processing module. 

31. The server of claim 30 wherein at least one of 
the processing units of said processing module is 
assigned to a first user supported by a first one of said 
line modules and a second user supported by a second one 

5 of said line modules. 

32. A method for processing packets within a 
server, said method comprising the steps of: 
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converting a packet input to said server into a 
stream of fixed length cells; 
5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; and 

sending said formatted packet to a processing module 
configured to support each of said plurality of protocol 
10 types - 

33. The method of claim 32 wherein said processing 
step includes reassembling said packet. 

34. The method of claim 33 wherein said processing 
step further includes examining said cell stream to 
obtain control information for said packet. 

35. The method of claim 34 wherein said control 
information includes information identifying a particular 
processing module for further processing said packet. 

36. The method of claim 34 wherein said processing 
step further includes processing said control information 
to produce additional control information for use in 
reassembling and formatting said packet. 

37. The method of claim 36 wherein said processing 
step further includes identifying a particular processing 
module to which said packet should be sent. 

38. The method of claim 37 wherein the sending step 
includes sending said packet to said particular 
processing module identified in said processing step. 

39. The method of claim 37 wherein said identifying 
step includes identifying a particular protocol 
processing unit on said particular processing module for 
processing control infoinnation corresponding to said 

5 packet. 
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40. The method of claim 33 further comprising the 
step of performing mid-network processing functions on 
said sent packet using said processing module. 

41. The method of claim 40 wherein the step of 
performing mid-network processing functions includes 
formatting said packet for its destination interface. 

42. The method of claim 41 further comprising the 
step of sending the packet formatted by said processing 
module to a line module corresponding to said destination 
interface. 

43. The method of claim 32 wherein said sending 
step includes sending said packet through a switch 
fabric. 

44. The method of claim 32 wherein said input 
packet is a packet represented by a plurality of fixed 
length cells. 

45. The method of claim 44 wherein the converting 
step includes modifying the length of said input cells. 

4 6. A method for processing packets within a 
server, said method comprising the steps of: 

converting a packet input to said server into a 
stream of fixed length cells; 
5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; and 

sending said formatted packet to another line module 
configured to support each of said plurality of protocol 
10 types. 

47. A method for processing packets within a 
server, said method comprising the steps of: 

converting a packet input to said server into a 
stream of fixed length cells; 
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5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; 

sending said formatted packet to a processing module 
configured to support each of said plurality of protocol 
10 types; and 

processing said stream of fixed length cells in said 
processing module; 

refomatting said fomatted packet into one of a 
plurality of protocol types; and 
15 sending said reformatted packet to another 

processing module. 
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